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(57) Abstract: The invention relates to the field of biology. In particular, the invention relates to a method and system for designing 
Q optimized mulli-epitope vaccines having selected combinations of spacer nucleic acid at the junctions of the multi-epitope constructs 

encoding a plurality of CTL and/or HTL eoutioc oeotudes so as to minimize the number of junciontal epitopes and provide vaccines 
^* with increased immunogenicity. 
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METHOD AND SYSTEM FOR OPTIMIZING MULTI-EPITOPE NUCLEIC ACID 
CONSTRUCTS AND PEPTIDES ENCODED THEREBY 

BACKGROUND 

This present invention relates to the field of biology. In particular, it relates to 
multi-epitope nucleic acid vaccines and methods of designing such vaccines to provide 
increased immunogenicity. 

The technology relevant to multi-epitope ("minigene" or "EpiGene™") vaccines is 
developing. Several independent studies have established that induction of simultaneous 
immune responses against multiple epitopes can be achieved. For example, responses 
against a large number of T cell specificities can be induced and detected. In natural 
situations, Doolan et al (Immunity, Vol. 7(1):97-1 12 (1997)) simultaneously detected recall 
T cell responses, against as many as 17 different P. falciparum epitopes using PBMC from 
a single donor. Similarly, Bertoni and colleagues (J Clin Invest, Vol 100(3):503-13 
(1997)) detected simultaneous CTL responses against 12 different HBV-derived epitopes in 
a single donor. In tenns of immunization with multi-epitope nucleic acid vaccines, several 
examples have been reported where multiple T cell responses were induced. For example, 
minigene vaccines composed of approximately ten MHC Class I epitopes in which all 
epitopes were immunogenic and/or antigenic have been reported. Specifically, minigene 
vaccines composed of 9 EBV (Thomson et al., Proc Natl Acad Sci USA, Vol. 
92(13):5845-9 (1995)), 7 HIV (Woodberry et al., J Virol, Vol. 73(7):5320-5 (1999)), 10 
murine (Thomson et al., J Immunol, Vol. 160(4): 17 17-23 (1998)) and 10 tumor-derived 
(Mateo et al., J Immunol, Vol. 163(7):4058-63 (1999)) epitopes have been shown to be 
active. It has also been shown that a multi-epitope DNA plasmid encoding nine different 
HLA-A2.1- and Al 1 -restricted epitopes derived from HBV and HIV induced CTL against 
all epitopes (Ishioka et al. J Immunol, Vol. 162(7):3915-25 (1999)). 

Thus, minigene vaccines containing multiple MHC Class I and Class II (Le. 9 CTL) 
epitopes can be designed, and presentation and recognition can be obtained for all epitopes. 
However, the immunogenicity of multi-epitope constructs appears to be strongly influenced 
by a number of variables, a number of which have heretofore been unknown. For example, 
the immunogenicity (or antigenicity) of the same epitope expressed in the context of 
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need to identify strategies to optimize multi-epitope vaccine constructs. Such optimization 
is important in terms of induction of potent immune responses and ultimately, for clinical 
efficacy. Accordingly, the present invention provides strategies to optimize antigenicity 
and immunogenicity of multi-epitope vaccines encompassing a large number of epitopes, 
and optimized multi-epitope vaccines, particularly minigene vaccines, generated in 
accordance with these strategies. 

The following paragraphs provide a brief review of some of the main variables 
potentially influencing minigene immunogenicity, epitope processing, and presentation on 
antigen presenting cells (APCs) in association with Class I and Class II MHC molecules. 

Immunodbminance 

Of the many thousand possible peptides that are encoded by a complex foreign 
pathogen, only a small fraction ends up in a peptide form capable of binding to MHC Class 
I antigens and thus of being recognized by T cells. This phenomenon, of obvious potential 
impact on the development of a multi-epitope vaccine, is known as immunodominance 
(Yewdell et al., Anna Rev Immunol, 17:51-88 (1999)). Several major variables contribute 
to immunodominance. Herein, we describe variables affecting the generation of the 
appropriate peptides, both in qualitative and quantitative terms, as a result of intracellular 
processing. 

Junctional Epitopes 

A junctional epitope is defined as an epitope created due to the juxtaposition of two 
other epitopes. The new epitope is composed of a C-tenninal section derived from a first : 
epitope, and an N-terminal section derived from a second epitope. Creation of junctional 
epitopes is a potential problem in the design of multi-epitope minigene vaccines, for both 
Class I and Class II restricted epitopes for the following reasons. Firstly, when developing 
a minigene composed of, or containing, human epitopes, which are typically tested for 
immunogenicity in HLA transgenic laboratory animals, the creation of murine epitopes 
could create undesired immunodominance effects. Secondly, the creation of new, 
unintended epitopes for human HLA Class I or Class II molecules could elicit in vaccine 
recipients, new T cell specificities that are not expressed by infected cells or tumors that are 
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the targets-induced T cell responses. These responses are by definition irrelevant and 
ineffective and could even be counterproductive, by creating undesired immunodominance 
effects. 

The existence of junctional epitopes has been documented in a variety of different 
5 experimental situations. Gefter and collaborators first demonstrated the effect in a system 
in which two different Class II restricted epitopes were juxtaposed and colinearly 
synthesized (Perkins et al, J Immunol, Vol. 146(7):2137-44 (1991)). The effect was so 
marked that the immune system recognition of the epitopes could be completely "silenced" 
by these new junctional epitopes (Wang et aL, Cell Immunol, Vol. 143(2):284-97 (1992)). 

10 Helper T cells directed against junctional epitopes were also observed in humans as a result 
of immunization with a synthetic lipopeptide, which was composed of an HLA-A2- 
restricted HB V-derived immunodominant CTL epitope, and a universal Tetanus Toxoid- 
derived HTL epitope (Livingston et al, J Immunol, Vol. 159(3):1383-92 (1997)). Thus, the 
creation of junctional epitopes are a major consideration in the design of multi-epitope 

15 constructs. 

The present invention provides methods of addressing this problem and avoiding or 
minimizing the occurrence of junctional epitopes. 

Flanking regions - 1 

Class I restricted epitopes are generated by a complex process (Y ewdell et al., Annu 
20 Rev Immunol, 17:51-88 (1999)). Limited proteolysis involving endoproteases and potential 
trimming by exoproteases is followed by translocation across the endoplasmic reticulum 
(ER) membrane by transporters associated with antigen processing (TAP) molecules. The 
major cytosolic protease complex involved in generation of antigenic peptides, and their 
precursors, is the proteosome (Niedermann et aL, Immunity, Vol. 2(3):289-99 (1995)), 
25 although ER trimming of CTL precursors has also been demonstrated (Paz et al., Immunity 
Vol. 1 1(2):241-51 (1999)). It has long been debated whether or not the residues 
immediately flanking the C and N terminus of the epitope, have an influence on the 
efficiency of epitope generation. 

The yield and availability of processed epitope has been implicated as a major 
30 variable in determining immunogenicity and could thus clearly have a major impact on 
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overall minigene potency in that the magnitude of immune response can be directly 
proportional to the amount of epitope bound by MHC and displayed for T cell recognition. 
Several studies have provided evidence that this is indeed the case. For example, induction 
of virus-specific CTL that is essentially proportional to epitope density (Wherry et al., / 
5 Immunol, Vol. 163(7):3735-45 (1999)) has been observed. Further, recombinant 

minigenes, which encode a preprocessed optimal epitope, have been used to induce higher 
levels of epitope expression than naturally observed with full-length protein (Anton et al., / 
Immunol, Vol. 158(6):2535-42 (1997)). In general, minigene priming has been shown to 
. be more effective than priming with the whole antigen (Restifo et al., J Immunol, Vol. 
10 154(9):4414-22 (1995); Ishioka et al., J Immunol, Vol. 162(7):3915-25 (1999)), even 

though some exceptions have been noted (Iwasaki et al., Vaccine, Vol. 17(15-16):2081-8 
(1999)). 

Early studies concluded that residues within the epitope (Hahn et al., J Exp Med, 
Vol. 176(5): 1335-41 (1992)) primarily regulate immunogenicity. Similar conclusions were 

15 reached by other studies, mostly based on grafting an epitope in an unrelated gene, or in the 
same gene, but in a different location (Chimini et al., J Exp Med, Vol. 169(l):297-302 
(1989); Hahn et al., J Exp Med, Vol. 174(3):733-6 (1991)). Other experiments however 
(DelVal et al., Cell, Vol. 66(6):1 145-53 (1991); Hahn et al, J Exp Med, Vol. 176(5):1335- 
41 (1992)), suggested that residues localized directly adjacent to the CTL epitope can 

20 directly influence recognition (Couillin et al., J Exp Med, Vol. 180(3): 1 129-34 (1994); 

Bergmann et al., J Virol Vol. 68(8):5306-10 (1994)). In the context of minigene vaccines, 
the controversy has been renewed. Shastri and coworkers (Shastri et al., J Immunol, Vol. 
155(9):4339-46 (1995)) found that T cell responses were not significantly affected by 
varying the N-terminal flanking residue but were inhibited by the addition of a single C- 

25 terminal flanking residue. The most dramatic inhibition was observed with isoleucine, 
leucine, cysteine, and proline as the C-terminal flanking residues. In contrast, Gileadi 
(Gileadi et al., Eur J Immunol, Vol. 29(7):22 13-22 (1999)) reported profound effects as a 
function of the residues located at the N terminus of mouse influenza virus epitopes. 
Bergmann and coworkers found that aromatic, basic and alanine residues supported 

30 efficient epitope recognition, while G and P residues were strongly inhibitory (Bergmann et 
al., J Immunol, Vol. 157(8):3242-9 (1996)). In contrast, Lippolis (Lippolis et al., / Virol, 
Vol. 69(5):3 134-46 (1995)) concluded that substituting flanking residues did not effect 
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recognition. However, only rather conservative substitutions which are unlikely to affect 
proteosome specificity, were tested. 

It appears that the specificity of these effects, and in general of natural epitopes, 
roughly correlates with proteosome specificity. For example, proteosome specificity is = 
partly trypsin-like (Niedermann et al., Immunity, Vol 2(3):289-99 (1995)), with cleavage 
following basic amino acids. Nevertheless, efficient cleavage of the carboxyl side of 
hydrophobic and acidic residues is also possible. Consistent with these specificities are the 
studies of Sherman and collaborators, which found that an R to H mutation at the position 
following the C-tenninus of a p53 epitope affects proteosome-mediated processing of the 
protein (Theobald et al., J Exp Med, Vol. 188(6): 101 7-28 (1998)). Several other studies 
(Hanke et al., J Gen Virol Vol. 79 ( Pt l):83-90 (1998); Thomson et al., Proc Natl Acad Sci 
U SA, Vol. 92(13):5845-9 (1995)) indicated that minigenes can be constructed utilizing 
minimal epitopes, and that these flanking sequences appear not be required, although the 
potential for further optimization by the use of flanking regions was also acknowledged. 

In sum, for HLA Class I epitopes, the effects of flanking regions on processing and 
presentation of CTL epitopes is as yet undefined. A systematic analysis of the effect of 
modulation of flanking regions has not been performed for minigene vaccines. Thus, 
analysis utilizing minigene vaccines encoding epitopes restricted by human Class I in 
general is needed. The present invention provides such an analysis and accordingly, 
provides multi-epitope vaccine constructs optimized for immunogenicity and antigenicity, 
and methods of designing such constructs. 

HLA Class II peptide complexes are also generated as a result of a complex series 
of events that is distinct from HLA Class I processing. The processing pathway involves 
association with Invariant chain (Ii), its transport to specialized compartments, the 
degradation of Ii to CLIP, and HLA-DM catalyzed removal of CLIP (see (Blum et al., Crit 
Rev Immunol Vol. 17(5-6):411-7 (1997); Arndt et al, Immunol Res, Vol. 16(3):261-72 
(1997)) for review. Moreover, there is a potentially crucial role of various cathepsins in 
general, and cathepsin S and L in particular, in Ii degradation (Nakagawa et al. Immunity, 
Vol. 10(2):207-17 (1999)). In terms of generation of functional epitopes however, the 
process appears to be somewhat less selective (Chapman H.A, Curr Opin Immunol, 
Vol. 10(1):93-102 (1998)), and peptides of many sizes can bind to MHC Class IMHC Class 
H (Hunt et al., Science, Vol. 256(5065): 1817-20 (1992)). Most or all of the possible 
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peptides appear to be generated (Moudgil et al., J Immunol Vol. 159(6):2574-9 (1997); and 
Thomson et ah, J Virol Vol. 72(3):2246-52 (1998)). Thus, as compared to the issue of 
flanking regions, the creation of junctional epitopes can be a more serious concern in 
particular embodiments. 

5 

SUMMARY 

The invention provides a method and system for optimizing the efficacy of multi- 
epitope vaccines so as to minimize the number of junctional epitopes and maximize, or at 
least increase, the immunogenicity and/or antigenicity of multi-epitope vaccines. In 

10 particular, the present invention provides multi-epitope nucleic acid constructs encoding a 
plurality of CTL and/or HTL epitopes. 

In one embodiment of the invention, a computerized method for designing a multi- 
epitope construct having multiple epitopes includes the steps of: storing a plurality of input 
parameters in a memory of a computer system, the input parameters including a plurality of 

15 epitopes, at least one motif for identifying junctional epitopes, a plurality of amino acid 

insertions and at least one enhancement weight value for each insertion; generating a list of 
epitope pairs from the plurality of epitopes; determining for each epitope pair at least one 
optimum combination of amino acid insertions based on the at least one motif, the plurality 
of insertions and the at least one enhancement weight value for each insertion; and 

20 identifying at least one optimum arrangement of the plurality of epitopes, wherein a 

respective one of the at least one optimum combination of amino acid insertions is inserted 
at a respective junction of two epitopes, so as to provide an optimized multi-epitope 
construct. In a preferred embodiment, the step of identifying at least one optimum 
arrangement of epitopes may be accomplished by perfoiming either an exhaustive search 

25 wherein all permutations of arrangements of the plurality of epitopes are evaluated or a 

stochastic search wherein only a subset of all permutations of arrangements of the plurality 
of epitopes are evaluated. 

In a further embodiment, the method determines for each epitope pair at least one 
optimum combination of amino acid insertions by calculating a function value (F) for each 

30 possible combination of insertions for each epitope pair, wherein the number of insertions 
in a combination may range from 0 to a maximum number of insertions (Maxlnsertions) 
value input by a user, and the function value is calculated in accordance with the equation F 

6 
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= (C+N)/J, when J > 0, and F = 2(C+N), when J = 0, wherein C equals the enhancement 
weight value of a C+l flanking amino acid, N equals the enhancement weight value of ah 
N-l flanking amino acid, and J equals the number of junctional epitopes detected for each* 
respective combination of insertions in an epitope pair based on said at least one motif. 
5 In another embodiment of the invention, a computer system for designing a multi- 

epitope construct having multiple epitopes, includes: a memory for storing a plurality of. 
input parameters such as a plurality of epitopes, at least one motif for identifying junctional 
epitopes, a plurality of amino acid insertions and at least one enhancement weight value for . 
each insertion; a processor for retrieving the input parameters from memory and generating 

1 0 a list of epitope pairs from the plurality of epitopes; wherein the processor further 
determines for each epitope pair at least one optimum combination of amino acid 
insertions, based on the at least one motif, the plurality of insertions and the at least one 
enhancement weight value for each insertion. The processor further identifies at least one 
optimum arrangement of the plurality of epitopes, wherein a respective one of the optimum 

15 combinations of amino acid insertions are inserted at a respective junction of two epitopes, • 
to provide an optimized multi-epitope construct; and a display monitor, coupled to the 
processor, for displaying at least one optimum arrangement of the plurality of epitopes to a 
user. 

In a further embodiment, the invention provides a data storage device storing a 
20 computer program for designing a multi-epitope construct having multiple epitopes, the 
computer program, when executed by a computer system, performing a process that, 
includes the steps of: retrieving a plurality of input parameters from a memory of a - 
computer system, the input parameters including, for example, a plurality of epitopes, at = 
least one motif for identifying junctional epitopes, a plurality of amino acid insertions and 
25 at least one enhancement weight value for each insertion; generating a list of epitope pairs 
from the plurality of epitopes; determining for each epitope pair at least one optimum 
combination of amino acid insertions based on the at least one motif, the plurality of 
insertions and the at least one enhancement weight value for each insertion; and identifying 
at least one optimum arrangement of the plurality of epitopes, wherein a respective one of 
30 the at least one optimum combination of amino acid insertions is inserted at a respective 
junction of two epitopes, so as to provide an optimized multi-epitope construct. 
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In another embodiment, the invention provides a method and system for designing a 
multi-epitope construct that comprises multiple epitopes. The method comprising steps of: 

(i) sorting the multiple epitopes to minimize the number of junctional epitopes; 

(ii) introducing a flanking amino acid residue at a C+l position of an epitope to be included 
5 within the multi-epitope construct; (iii) introducing one or more amino acid spacer residues 

between two epitopes of the multi-epitope construct, wherein the spacer prevents the 
occurrence of a junctional epitope; and, (iv) selecting one or more multi-epitope constructs ■. 
that have a minimal number of junctional epitopes, a minimal number of amino acid spacer 
residues, and a maximum number of flanking amino acid residues at a C+l position relative 

10 to each epitope. In some embodiments,, the spacer residues are independently selected from 
residues that are not known HLA Class II primary anchor residues. In particular 
embodiments, introducing the spacer residues prevents the occurrence of an HTL epitope; 
Such a spacer often comprises at least 5 amino acid residues independently selected from 
the group consisting of G, P, and N. In some embodiments the spacer is GPGPG. 

15 In some embodiments, introducing the spacer residues prevents the occurrence of a 

CTL epitope and further, wherein the spacer is 1, 2, 3, 4, 5, 6, 7, or 8 amino acid residues 
independently selected from the group consisting of A and G. Often, the flanking residue 
is introduced at the C+l position of a CTL epitope and is selected from the group 
consisting of K, R, N, G, and A. In some embodiments, the flanking residue is adjacent to 

20 the spacer sequence. The method of the invention can also include substituting an N- 

terminal residue of an epitope that is adjacent to a C-terminus of an adjacent epitope within 
the multi-epitope construct with a residue selected from the group consisting of K, R, N, G, 
and A. 

In some embodiments, the method of the invention can also comprise a step of 
25 predicting a structure of the multi-epitope construct, and further, selecting one or more 
constructs that have a maximal structure, L e. , that are processed by an HLA processing 
pathway to produce all of the epitopes comprised by the construct. In some embodiments, 
the multi-epitope construct encodes EP-HTV-1090 as set out in Figure 9, HTV-CPT as set 
out in Figure 9, or HTV-TC as set out in Figure 9. 
30 In another embodiment of the invention, a system for optimizing multi-epitope 

constructs include a computer system having a processor (e.g., central processing unit) and 
at least one memory coupled to the processor for storing instructions executed by the 
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processor and data to be manipulated (i.e., processed) by the processor. The computer 
system further includes an input device (e.g., keyboard) coupled to the processor and the at 
least one memory for allowing a user to input desired parameters and information to be 
accessed by the processor. The processor may be a single CPU or a plurality of different 
5 processing devices/circuits integrated onto a single integrated circuit chip. Alternatively, 
the processor may be a collection of discrete processing devices/circuits selectively coupled 
to one another via either direct wire/conductor connections or via a data bus. Similarly, the 
at least one memory may be one large memory device (e.g., EPROM), or a collection of a 
plurality of discrete memory devices (e.g., EEPROM, EPROM, RAM, DRAM, SDRAM, 

10 Flash, etc.) selectively coupled to one another for selectively storing data and/or program 
information (i.e., instructions executed by the processor). Those of ordinary skill in the art 
would easily be able to implement a desired computer system architecture to perform the 
operations and functions disclosed herein. 

In one embodiment, the computer system includes a display monitor for displaying 

15 information, instructions, images, graphics, etc. The computer system receives user inputs 
via a keyboard. These user input parameters may include, for example, the number of 
insertions (i.e., flanking residues and spacer residues), the peptides to be processed, the . 
C+l and N-l weighting values for each amino acid, and the motifs to use for searching for 
junctional epitopes. Based on these input values/parameters, the computer system executes 

20 a "Junctional Analyzer" software program which automatically determines the number of 
junctional epitope for each peptide pair and also calculates an "enhancement" value for 
each combination of flanking residues and spacers that may be inserted at the junction of - 
each peptide pair. The results of the junctional analyzer program are then used in either an 
exhaustive or stochastic search program which determines the "optimal" combination or linkage of 

25 the entire set of peptides to create a multi-epitope polypeptide, or nucleic acids, having a minimal 
number of junctional epitopes and a maximum functional (e.g., immunogenicity) value. 

In one embodiment, if the number of peptides to be processed by the computer 
system is less than fourteen, an exhaustive search program is executed by the computer 
system which examines all permutations of the peptides making up the polypeptide to find 

30 the permutation with the "best" or "optimal" function value. In one embodiment, the 

function value is calculated using the equation (Ce + Ne)/J when J is greater than zero and 
2 * (Ce + Ne) when J is equal to zero, where Ce is the enhancement "weight" value of an 
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amino acid at the C+l position of a peptide, Ne is the enhancement "weight" value of an 
amino acid at the N-l position of a peptide, and J is the number of junctional epitopes 
contained in the polypeptide encoded by multi-epitope nucleic acid sequence. Thus, 
maximizing this function value will identify the peptide pairs having the least number of 
5 junctional epitopes and the maximum enhancement weight value for flanking residues. If 
the number of peptides to be processed is fourteen or more, the computer system executes a 
stochastic search program that uses a "Monte Carlo" technique to examine many regions of 
the permutation space to find the best estimate of the optimum arrangement of peptides 
(e.g., having the maximum function value). '■ 

10 In a further embodiment, the computer system allows a user to input parameter 

values which format or limit the output results of the exhaustive or stochastic search 
program. For example, a user may input the maximum number of results having the same 
function value ('MaxDuplicateFunctionValue = X") to limit the number of permutations 
that are generated as a result of the search. Since it is possible for the search programs to 

15 find many arrangements that give the same function value, it may be desirable to prevent 
the output file from being filled by a large number of equivalent solutions. Once this limit 
is reached no more results are reported until a larger or "better" function value is found. As 
another example, the user may input the maximum number of "hits" per probe during a 
stochastic search process. This parameter prevents the stochastic search program from 

20 generating too much output on a single probe. In a preferred embodiment, the number of 
permutations examined in a single probe is limited by several factors: the amount of time 
set for each probe in the input text file; the speed of the computer, and the values of the < 
parameters "MaxHitsPerProbe" and cc MaxDuplicateFunctionValues." The algorithms used 
to generate and select permutations for analysis may be in accordance with well-known 

25 recursive algorithms found in many computer science text books. For example, six 
permutations of three things taken three at a time would be generated in the following 
sequence: ABC; ACB; BAC; BCA; CBA; CAB. As a further example of an input 
parameter, a user may input how the stochastic search is performed, e.g., randomly, 
statistically or other methodology; the maximum time allowed for each probe (e.g., 5 

30 minutes); and the number of probes to perform. 

Also disclosed herein are multi-epitope constructs designed by the methods 
described above and hereafter. The multi-epitope constructs include spacer nucleic acids 

10 
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between a subset of the epitope nucleic acids or all of the epitope nucleic acids. One or 
more of the spacer nucleic acids may encode amino acid sequences different from amino 
acid sequences encoded by other spacer nucleic acids to optimize epitope processing and to 
minimize the presence of junctional epitopes. 

5 ■ 

DEFINITIONS 

The following definitions are provided to enable one of ordinary skill in the art to : 
understand some of the preferred embodiments of invention disclosed herein. It is 
understood, however, that these definitions are exemplary only and should not be used to 

10 limit the scope of the invention as set forth in the claims. Those of ordinary skill in the art 
will be able to construct slight modifications to the definitions below and utilize such 
modified definitions to understand and practice the invention disclosed herein. Such 
modifications, which would be obvious to one of ordinary skill in the art, as they may be 
applicable to the claims set forth below, are considered to be within the scope of the present 

15 invention. 

Throughout this disclosure, "binding data" results are often expressed in terms of 
"ICso's." IC50 is the concentration of peptide in a binding assay at which 50% inhibition of 
binding of a reference peptide is observed. Given the conditions in which the assays are 
run limiting HLA proteins and labeled peptide concentrations), these values 

20 approximate Kd values. Assays for determining binding are described in detail, e.g., in 

PCT publications WO 94/20127 and WO 94/03205. It should be noted that IC50 values can 
. change, often dramatically, if the assay conditions are varied, and depending on the 
particular reagents used (eg., HLA preparation, etc.). For example, excessive 
concentrations of HLA molecules will increase the apparent measured IC50 of a given 

25 ligand. Alternatively, binding is expressed relative to a reference peptide. Although as a 
particular assay becomes more, or less, sensitive, the ICso's of the peptides tested may « 
change somewhat, the binding relative to the reference peptide will not significantly 
change. For example, in an assay run under conditions such that the IC50 of the reference 
peptide increases 10-fold, the IC50 values of the test peptides will also shift approximately 

30 10-fold. Therefore, to avoid ambiguities, the assessment of whether a peptide is a good, 
intermediate, weak, or negative binder is generally based on its IC50, relative to the IC50 of 
a standard peptide. Binding may also be determined using other assay systems including 
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those using: live cells (e.g., Ceppellini et al, Nature 339:392, 1989; Christnick et aU 
Nature 352:67, 1991; Busch et al, Int. Immunol 2:443, 19990; Hill et al, J. Immunol 
147:189, 1991; del Guercio et al, J. Immunol 154:685, 1995), cell free systems using 
detergent lysates (e.g., Cerundolo et al, J. Immunol 21:2069, 1991), immobilized purified 
5 MHC (e.g., Hill et al, J. Immunol 152, 2890, 1994; Marshall et al, J. Immunol 
152:4946, 1994), ELISA systems (e.g., Reay et al, EMBO J. 11:2829, 1992), surface 
plasmon resonance (e.g, Khilko et al, J. Biol Chem. 268: 15425, 1993); high flux 
soluble phase assays (Hammer et al, J. Exp. Med. 180:2353, 1994), and measurement of 
class I MHC stabilization or assembly (e.g., Ljunggren et al, Nature 346:476, 1990; 

10 Schumacher et al, Cell 62:563, 1990; Townsend et al, Cell 62:285, 1990; Parker et al, J. 
Immunol 149:1896, 1992). 

The designation of a residue position in an epitope as the "carboxyl terminus" or the : 
"carboxyl terminal position" refers to the residue position at the end of the epitope that is 
nearest to the carboxyl terminus of a peptide, which is designated using conventional 

1 5 nomenclature as defined below. "C + 1" refers to the residue or position immediately 
following the C-terminal residue of the epitope, i.e., refers to the residue flanking the C- 
terminus of the epitope. The "carboxyl terminal position" of the epitope occurring at the 
carboxyl end of the multi-epitope construct may or may not actually correspond to the 
carboxyl terminal end of polypeptide. In preferred embodiments, the epitopes employed in 

20 the optimized multi-epitope constructs are motif-bearing epitopes and the carboxyl 

terminus of the epitope is defined with respect to primary anchor residues corresponding to 
a particular motif. 

The designation of a residue position in an epitope as "amino terminus" or "amino- 
terminal position" refers to the residue position at the end of the epitope which is nearest to 

25 the amino terminus of a peptide, which is designated using conventional nomenclature as 
defined below. "N-l" refers to the residue or position immediately adjacent to the epitope 
at the amino terminal end (position number 1) of an eptiope. The "amino terminal 
position" of the epitope occurring at the amino terminal end of the multi-epitope construct 
may or may not actually corresponds to the amino terminal end of the polypeptide. In 

30 preferred embodiments, the epitopes employed in the optimized multi-epitope constructs 
are motif-bearing epitopes and the amino terminus of the epitope is defined with respect to 
primary anchor residues corresponding to a particular motif. 
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A "computer" or "computer system" generally includes: a processor; at least one 
information storage/retrieval apparatus such as, for example, a hard drive, a disk drive or a . 
tape drive; at least one input apparatus such as, for example, a keyboard, a mouse, a touch 
screen, or a microphone; and display structure. Additionally, the computer may include a 
5 communication channel in communication with a network such that remote users may 
communicate with the computer via the network to perform multi-epitope construct 
optimization functions disclosed herein. Such a computer may include more or less than 
what is listed above. The network may be a local area network (LAN), wide area network : 1 
(WAN) or a global network such as the world wide web (e.g., the internet). 

10 A "construct" as used herein generally denotes a composition that does not occur in 

nature. A construct can be produced by synthetic technologies, e.g. , recombinant DNA 
preparation and expression or chemical synthetic techniques for nucleic or amino acids. A 
construct can also be produced by the addition or affiliation of one material with another 
such that the result is not found in nature in that form. A "multi-epitope construct" can be 

1 5 used interchangeably with the term "minigene" or "multi-epitope nucleic acid vaccine," 
and comprises multiple epitope nucleic acids that encode peptide epitopes of any length 
that can bind to a molecule functioning in the immune system, preferably a class I HLA 
and a T-cell receptor or a class II HLA and a T-cell receptor. All of the epitope nucleic 
acids in a multi-epitope construct can encode class I HLA epitopes or class II HLA 

20 epitopes. Class I HLA-encoding epitope nucleic acids are referred to as CTL epitope 
nucleic acids, and class II HLA-encoding epitope nucleic acids are referred to as HTL 
epitope nucleic acids. Some multi-epitope constructs can have a subset of the multi-epitope 
nucleic acids encoding class I HLA epitopes and another subset of the multi-epitope • 
nucleic acids encoding class II HLA epitopes. The CTL epitope nucleic acids preferably 

25 encode an epitope peptide of about eight to about thirteen amino acids in length, more 
preferably about eight to about eleven amino acids in length, and most preferably about 
nine amino acids in length. The HTL epitope nucleic acids can encode an epitope peptide 
of about seven to about twenty three, preferably about seven to about seventeen, more 
preferably about eleven to about fifteen, and most preferably about thirteen amino acids in 

30 length. The multi-epitope constructs described herein preferably include five or more, ten 
or more, fifteen or more, twenty or more, or twenty-five or more epitope nucleic acids. All 
of the epitope nucleic acids in a multi-epitope construct may be from one organism (e.g., 
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the nucleotide sequence of every epitope nucleic acid may be present in HIV strains), or the 
multi-epitope construct may include epitope nucleic acids present in two or more different 
organisms (e.g., some epitopes from HIV and some from HCV). The term "EpiGene™" is 
used herein to refer to certain multi-epitope constructs. As described hereafter, one or more 
5 epitope nucleic acids in the multi-epitope construct may be flanked by a spacer nucleic 
acid. 

A "multi-epitope vaccine," which is synonyous with a "polyepitopic vaccine," is a 
vaccine comprising multiple epitopes. 

"Cross-reactive binding" indicates that a peptide is bound by more than one HLA 
10 molecule; a synonym is "degenerate binding." 

A "cryptic epitope" elicits a response by immunization with an isolated peptide, but 
the response is not cross-reactive in vitro when intact whole protein that comprises the 
epitope is used as an antigen. 

A "dominant epitope" is an epitope that induces an immune response upon 
1 5 immunization with a whole native antigen (see, e.g. , Sercarz, et al. , Annu. Rev. Immunol. 
1 1 :729-766, 1993). Such a response is cross-reactive in vitro with an isolated peptide 
epitope. 

With regard to a particular amino acid sequence, an "epitope" is a set of amino acid 
residues which is involved in recognition by a particular immunoglobulin, or in the context 
20 of T cells, those residues necessary for recognition by T cell receptor proteins and/or Major 
Histocompatibility Complex (MHC) receptors. In an immune system setting, in vitro or in 
vivo, an epitope is the collective features of a molecule, such as primary, secondary and 
tertiary peptide structure, and charge, that together form a site recognized by an 
immunoglobulin, T cell receptor or HLA molecule. Throughout this disclosure epitope and 
25 peptide are often used interchangeably. It is to be appreciated, however, that isolated or 
purified protein or peptide molecules larger than and comprising an epitope of the 
invention are still within the bounds of the invention. 

A "flanking residue" is a residue that is positioned next to an epitope. , A flanking 
residue can be introduced or inserted at a position adjacent to the N-terminus or the C- 
30 terminus of an epitope. 

An "immunogenic peptide" or "peptide epitope" is a peptide that comprises an 
allele-specific motif or supennotif such that the peptide will bind an HLA molecule and 
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induce a CTL and/or HTL response. Thus, immunogenic peptides of the invention are 
capable of binding to an appropriate HLA molecule and thereafter inducing a cytotoxic T 
cell response, or a helper T cell response, to the antigen from which the immunogenic 
peptide is derived. 

5 "Heteroclitic analogs" are defined herein as a peptide with increased potency for a 

specific T cell, as measured by increased responses to a given dose, or by a requirement of 
lesser amounts to achieve the same response. Advantages of heteroclitic analogs include 
that the epitopes can be more potent, or more economical (since a lower amount is required 
to achieve the same effect). In addition, modified epitopes might overcome antigen- 

10 specific T cell unresponsiveness (T cell tolerance). 

"Human Leukocyte Antigen" or "HLA" is a human class I or class II Major 
Histocompatibility Complex (MHC) protein {see, e.g., Stites, et al, IMMUNOLOGY, 8 th Ed., 
Lange Publishing, Los Altos, CA (1994)). 

An "HLA supertype or HLA family," as used herein, describes sets of HLA 

15 molecules grouped based on shared peptide-binding specificities. HLA class I molecules 
that share similar binding affinity for peptides bearing certain amino acid motifs are 
grouped into such HLA supertypes. The terms HLA superfamily, HLA supertype family, 
HLA family, and HLA xx-like molecules (where xx denotes a particular HLA type), are 
synonyms. 

20 As used herein, "high affinity" with respect to HLA class I molecules is defined as 

binding with an IC50, or K D value, of 50 nM or less; "intermediate affinity" with respect to 
HLA class I molecules is defined as binding with an IC50 or Kd value of between about 50 
and about 500 nM. tc High affinity" with respect to binding to HLA class II molecules is 
defined as binding with an IC50 or K D value of 100 nM or less; "intermediate affinity" with 
25 respect to binding to HLA class II molecules is defined as binding with an IC50 or Kd value •■ '■ 
of between about 100 and about 1000 nM. 

An "IC50" is the concentration of peptide in a binding assay at which 50% inhibition 
of binding of a reference peptide is observed. Depending on the conditions in which the 
assays are run (z.e., limiting HLA proteins and labeled peptide concentrations), these values 
30 may approximate Kd values. 

The terms "identical" or percent "identity," in the context of two or more peptide 
sequences, refer to two or more sequences or subsequences that are the same or have a 
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specified percentage of amino acid residues that are the same, when compared and aligned 
for maximum correspondence over a comparison window, as measured using a sequence 
comparison algorithm or by manual alignment and visual inspection. 

"Introducing" an amino acid residue at a particular position in a multi-epitope 
5 construct, e.g., adjacent, at the C-teiminal side, to the C-terminus of the epitope, 

encompasses configuring multiple epitopes such that a desired residue is at a particular 
position, e.g., adjacent to the epitope, or such that a deleterious residue is not adjacent ta- 
ttle C-terminus of the epitope. The term also includes inserting an amino acid residue, 
preferably a preferred or intermediate amino acid residue, at a particular position. An 
10 amino acid residue can also be introduced into a sequence by substituting one amino acid 
residue for another. Preferably, such a substitution is made in accordance with analoging 
principles set forth, e.g., in co-pending U.S.S.N. 09/260,714 filed 3/1/99 and PCT 
application number PCT/US00/19774. 

The phrases "isolated" or "biologically pure" refer to material that is substantially 
15 or essentially free from components which normally accompany the material as it is found 
in its native state. Thus, isolated peptides in accordance with the invention preferably do 
not contain materials normally associated with the peptides in their in situ environment. 

"Link" or "join" refers to any method known in the art for functionally connecting 
peptides, including, without limitation, recombinant fusion, covalent bonding, disulfide 
20 bonding, ionic bonding, hydrogen bonding, and electrostatic bonding. 

"Major Histocompatibility Complex" or "MHC" is a cluster of genes that plays a 
role in control of the cellular interactions responsible for physiologic immune responses. In 1 
humans, the MHC complex is also known as the HLA complex. For a detailed description 
of the MHC and HLA complexes, see, Paul, FUNDAMENTAL IMMUNOLOGY, 3 rd Ed., Raven 
25 Press, New York, 1993. 

As used herein, "middle of the peptide" is a position in a peptide that is neither an 
amino or a carboxyl terminus. 

A "minimal number of junctional epitopes" as used herein refers to a number of 
junctional epitopes that is lower than what would be created using a random selection 
30 criteria. 

The term "motif refers to the pattern of residues in a peptide of defined length, 
usually a peptide of from about 8 to about 1 3 amino acids for a class I HLA motif and from 
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about 6 to about 25 amino acids for a class II HLA motif, which is recognized by a 
particular HLA molecule. Peptide motifs are typically different for each protein encoded 
by each human HLA allele and differ in the pattern of the primary and secondary anchor 
residues. 

5 A "negative binding residue" or "deleterious residue" is an amino acid which, if 

present at certain positions (typically not primary anchor positions) in a peptide epitope, 
results in decreased binding affinity of the peptide for the peptide's corresponding HLA 
molecule. 

"Optimizing" refers to increasing the immunogenicity or antigenicity of a multi- 

10 epitope construct having at least one epitope pair by sorting epitopes to minimize the 

occurrence of junctional epitopes, inserting flanking residues that flank the C-terminus or 
N-tenninus of an epitope, and inserting spacer residue to further prevent the occurrence of 
junctional epitopes or to provide a flanking residue. An increase in immunogenicity or 
antigenicity of an optimized multi-epitope construct is measured relative to a multi-epitope 

15 construct that has not been constructed based on the optimization parameters and is using 
assays known to those of skill in the art, e.g., assessment of immunogenicity in HLA 
transgenic mice, ELISPOT, inteferon-gamma release assays, tetramer staining, chromium 
release assays, and presentation on dendritic cells. 

The term "peptide" is used interchangeably with "oligopeptide" in the present 

20 specification to designate a series of residues, typically L-amino acids, connected one to the 
other, typically by peptide bonds between the cc-amino and carboxyl groups of adjacent 
amino acids. The preferred CTL-inducing peptides of the invention are 13 residues or less 
in length and usually consist of between about 8 and about 1 1 residues, preferably 9 or 10 
residues. The preferred HTL-inducing oligopeptides are less than about 50 residues in 

25 length and usually consist of between about 6 and about 30 residues, more usually between 
about 12 and 25, and often between about 15 and 20 residues. 

A 'TanDR binding peptide or PADRE® peptide" is a member of a family of 
molecules that binds more than one HLA class II DR molecule. The pattern that defines 
the PADRE® family of molecules can be thought of as an HLA Class II supermotif. 

30 PADRE® binds to most HLA-DR molecules and stimulates in vitro and in vivo human 
helper T lymphocyte (HTL) responses. 
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"Pharmaceutically acceptable" refers to a generally non-toxic, inert, and/or 
physiologically compatible composition. 

"Presented to an HLA Class I processing pathway" means that the multi-epitope 
constructs are introduced into a cell such that they are largely processed by an HLA Class I 
5 processing pathway. Typically, multi-epitope constructs are introduced into the cells using 
expression vectors that encode the multi-epitope constructs. HLA Class II epitopes that are 
encoded by such a multi-epitope construct are also presented on Class II molecules, 
although the mechanism of entry of the epitopes into the Class II processing pathway is not . ' 
defined. 

10 A "primary anchor residue" or a "primary MHC anchor" is an amino acid at a 

specific position along a peptide sequence that is understood to provide a contact point 
between the immunogenic peptide and the HLA molecule. One to three, usually two, 
primary anchor residues within a peptide of defined length generally defines a "motif for 
an immunogenic peptide. These residues are understood to fit in close contact with peptide 

1 5 binding grooves of an HLA molecule, with their side chains buried in specific pockets of 
the binding grooves themselves. In one embodiment, for example, the primary anchor 
residues of an HLA class I epitope are located at position 2 (from the amino terminal 
position) and at the carboxyl terminal position of a 9-residue peptide epitope in accordance • 
with the invention. The primary anchor positions for each motif and supermotif are 

20 described, for example, in Tables I and IE of PCT/US00/27766, or PCT/US00/ 19774. 

Preferred amino acids that can serve as in the anchors for most Class II epitopes consist of 
M and F in position one and V, M, S, T, A and C in position six. Tolerated amino acids 
that can occupy these positions for most Class II epitopes consist of L, I, V, W, and Y in 
position one and P, L and I in position six. The presence of these amino acids in positions 

25 one and six in Class II epitopes defines the HLA-DR1, 4, 7 supermotif. The HLA-DR3 • 
binding motif is defined by preferred amino acids from the group of L, I, V, M, F, Y and A 
in position one and D, E, N, Q, S and T in position four and K, R and H in position six. 
Other amino acids may be tolerated in these positions but they are not preferred. 

Furthermore, analog peptides can be created by altering the presence or absence of 

30 particular residues in these primary anchor positions. Such analogs are used to modulate 
the binding affinity of a peptide comprising a particular motif or supermotif. 
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"Promiscuous recognition" occurs where a distinct peptide is recognized by the 
same T cell clone in the context of various HLA molecules. Promiscuous recognition or 
binding is synonymous with cross-reactive binding. 

A "protective immune response" or "therapeutic immune response" refers to a CTL 
5 and/or an HTL response to an antigen derived from an infectious agent or a tumor antigen, 
which in some way prevents or at least partially arrests disease symptoms, side effects or 
progression. The immune response may also include an antibody response that has been 
facilitated by the stimulation of helper T cells. 

The term "residue" refers to an amino acid or amino acid mimetic incorporated into 
10 a peptide or protein by an amide bond or amide bond mimetic. 

A "secondary anchor residue" is an amino acid at a position other than a primary 
anchor position in a peptide that may influence peptide binding. A secondary anchor 
residue occurs at a significantly higher frequency amongst bound peptides than would be 
expected by random distribution of amino acids at one position. The secondary anchor 
15 residues are said to occur at "secondary anchor positions." A secondary anchor residue can 
be identified as a residue which is present at a higher frequency among high or intermediate 
affinity binding peptides, or a residue otherwise associated with high or intermediate 
affinity binding. For example, analog peptides can be created by altering the presence or 
absence of particular residues in these secondary anchor positions. Such analogs are used 
20 to finely modulate the binding affinity of a peptide comprising a particular motif or 
supermotif. The terminology "fixed peptide" is sometimes used to refer to an analog 
peptide. 

"Sorting epitopes" refers to determining or designing an order of the epitopes in a 
multi-epitope construct. 

25 A "spacer" refers to a sequence that is inserted between two epitopes in a multi- 

epitope construct to prevent the occurrence of junctional epitopes and/or to increase the 
efficiency of processing. A multi-epitope construct may have one or more spacer nucleic 
acids. A spacer nucleic acid may flank each epitope nucleic acid in a construct, or the 
spacer nucleic acid to epitope nucleic acid ratio may be about 2 to 10, about 5 to 10, about 

30 6 to 10, about 7 to 10, about 8 to 10, or about 9 to 10, where a ratio of about 8 to 10 has 
been determined to yield favorable results for some constructs. 

19 



WO 02/083714 



PCT/US02/09877 



The spacer nucleic acid may encode one or more amino acids. A spacer nucleic 
acid flanking a class I HLA epitope in a multi-epitope construct is preferably between one 
and about eight amino acids in length. A spacer nucleic acid flanking a class II HLA 
epitope in a multi-epitope construct is preferably greater than five, six, seven, or more 
5 amino acids in length, and more preferably five or six amino acids in length. 

The number of spacers in a construct, the number of amino acids in a spacer, and 
the amino acid composition of a spacer can be selected to optimize epitope processing 
and/or minimize junctional epitopes. It is preferred that spacers are selected by 
concomitantly optimizing epitope processing and junctional motifs. Suitable amino acids 

10 for optimizing epitope processing are described herein. Also, suitable amino acid spacing 
for minimizing the number of junctional epitopes in a construct are described herein for 
class I and class II HLAs. For example, spacers flanking class II HLA epitopes preferably 
include G, P, and/or N residues as these are not generally known to be primary anchor 
residues (see, e.g., PCT/US00/19774). A particularly preferred spacer for flanking a class 

15 II HLA epitope includes alternating G and P residues, for example, (GP) n , (PG) n , (GP) n G, 
(PG) n P, and so forth, where n is an integer between one and ten, preferably two or about 
two, and where a specific example of such a spacer is GPGPG. A preferred spacer, 
particularly for class I HLA epitopes, comprises one, two, three or more consecutive 
alanine (A) residues (see, for example, Figure 23 A ? which depicts a spacer having three 

20 consecutive alanine residues). 

In some multi-epitope constructs, it is sufficient that each spacer nucleic acid 
encodes the same amino acid sequence. In multi-epitope constructs having two spacer 
nucleic acids encoding the same amino acid sequence, the spacer nucleic acids encoding 
those spacers may have the same or different nucleotide sequences, where different 

25 nucleotide sequences may be preferred to decrease the likelihood of unintended 
recombination events when the multi-epitope construct is inserted into cells. 

In other multi-epitope constructs, one or more of the spacer nucleic acids may 
encode different amino acid sequences. While many of the spacer nucleic acids may 
encode the same amino acid sequence in a multi-epitope construct, one, two, three, four, 

30 five or more spacer nucleic acids may encode different amino acid sequences, and it is 
possible that all of the spacer nucleic acids in a multi-epitope construct encode different 
amino acid sequences. Spacer nucleic acids may be optimized with respect to the epitope 
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nucleic acids they flank by determining whether a spacer sequence will maximize epitope 
processing and/or minimize junctional epitopes, as described herein. 

Multi-epitope constructs may be distinguished from one another according to 
whether the spacers in one construct optimize epitope processing or minimize junctional 
5 epitopes over another construct, and preferably, constructs may be distinguished where one 
construct is concomitantly optimized for epitope processing and junctional epitopes over 
the other. Computer assisted methods and in vitro and in vivo laboratory methods for 
determining whether a construct is optimized for epitope processing and junctional motifs 
are described herein. 

10 A "subdominant epitope" is an epitope which evokes little or no response upon 

immunization with whole antigens which comprise the epitope, but for which a response 
can be obtained by immunization with an isolated epitope, and this response (unlike the 
case of cryptic epitopes) is detected when whole protein is used to recall the response in 
vitro or in vivo, 

15 A "supermotif ' is an amino acid sequence for a peptide that provides binding 

specificity shared by HLA molecules encoded by two or more HLA alleles. Preferably, a 
supermotif-bearing peptide is recognized with high or intermediate affinity (as defined 
herein) by two or more HLA antigens. 

"Synthetic peptide" refers to a peptide that is not naturally occurring, but is man- 
20 made using such methods as chemical synthesis or recombinant DNA technology. 

A 'TCR contact residue" or "T cell receptor contact residue" is an amino acid 
residue in an epitope that is understood to be bound by a T cell receptor; these are defined 
herein as not being any primary MHC anchor. T cell receptor contact residues are defined 
as the position/positions in the peptide where all analogs tested induce T-cell recognition 
25 relative to that induced with a wildtype peptide. 

The term "homology," as used herein, refers to a degree of complementarity 
between two nucleotide sequences. The word "identity" may substitute for the word 
"homology" when a nucleic acid has the same nucleotide sequence as another nucleic acid. 
Sequence homology and sequence identity can also be determined by hybridization studies 
30 under high stringency and/or low stringency, and disclosed herein are nucleic acids that 
hybridize to the multi-epitope constructs under low stringency or under high stringency. 
Also, sequence homology and sequence identity can be determined by analyzing sequences 

21 



WO 02/083714 



PCT/US02/09877 



using algorithms and computer programs known in the art. Such methods be used to assess 
whether a nucleic acid is identical or homologous to the multi-epitope constructs disclosed 
herein. The invention pertains in part to nucleotide sequences having 80% or more, 85% or 
more, 90% or more, 95% or more, 97% or more, 98% or more, or 99% or more identity to 
5 the nucleotide sequence of a multi-epitope construct disclosed herein. 

As used herein, the term "stringent conditions" refers to conditions which permit 
hybridization between nucleotide sequences and the nucleotide sequences of the disclosed 
multi-epitope constructs. Suitable stringent conditions can be defined by, for example, the 
concentrations of salt or formamide in the prehybridization and hybridization solutions, or 

10 by the hybridization temperature, and are well known in the art. In particular, stringency 
can be increased by reducing the concentration of salt, increasing the concentration of 
formamide, or raising the hybridization temperature. For example, hybridization under v 
high stringency conditions coutd occur in about 50% formamide at about 37°C to 42°C. In 
particular, hybridization could occur under high stringency conditions at 42°C in 50% 

15 formamide, 5x SSPE, 0.3% SDS, and 200 |ig/ml sheared and denatured salmon sperm 

DNA or at 42°C in a solution comprising 50% formamide, 5x SSC (750mM NaCl, 75 mM 
trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran 
sulfate, and 20 ng/ml denatured, sheared salmon sperm DNA, followed by washing the 
filters in O.lx SSC at about 65°C. Hybridization could occur under reduced stringency 

20 conditions in about 35% to 25% formamide at about 30°C to 35°C. For example, reduced 
stringency conditions could occur at 35°C in 35% formamide, 5x SSPE, 0.3% SDS, and 
200 p-g/ml sheared and denatured salmon sperm DNA; The temperature range 
corresponding to a particular level of stringency can be further narrowed by calculating the 
purine to pyrimidine ratio of the nucleic acid of interest and adjusting the temperature 

25 accordingly. Variations on the above ranges and conditions are well known in the art. 

In addition to utilizing hybridization studies to assess sequence identity or sequence 
homology, known computer programs may be used to determine whether a particular 
nucleic acid is homologous to a multi-epitope construct disclosed herein. An example of 
such a program is the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 

30 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, 

Madison, WI 5371 1), and other sequence alignment programs are known in the art and may 
be utilized for determining whether two or more nucleotide sequences are homologous. 
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Bestfit uses the local homology algorithm of Smith and Waterman, Advances in Applied 
Mathematics 2: 482-489 (1981), to find the best segment of homology between two 
sequences. When using Bestfit. or any other sequence alignment program to determine 
whether a particular sequence is, for instance, 95% identical to a reference sequence, the 
5 parameters may be set such that the percentage of identity is calculated over the full length 
of the reference nucleotide sequence and that gaps in homology of up to 5% of the total 
number of nucleotides in the reference sequence are allowed. 

The nomenclature used to describe peptide compounds follows the conventional 
practice wherein the amino group is presented to the left (the N-terminus) and the carboxyl 

10 group to the right (the C-terminus) of each amino acid residue. When amino acid residue * 
positions are referred to in an epitope, they are numbered in an amino to carboxyl direction 
with position one being the position closest to the amino terminal end of the epitope, or the 
peptide or protein of which it may be a part. In the formulae representing selected specific 
embodiments of the present invention, the amino- and carboxyl-terminal groups, although 

1 5 not specifically shown, are in the form they would assume at physiologic pH values, unless 
otherwise specified. In the amino acid structure formulae, each residue is generally 
represented by standard three-letter or single-letter designations. The L-fonn of an amino 
acid residue is represented by a capital single letter or a capital first letter of a three-letter > 
symbol, and the D-form for those amino acids having D-forms is represented by a lower 

20 case single letter or a lower case three letter symbol. Glycine has no asymmetric carbon 
atom and is simply referred to as "Gly" or G. Symbols for the amino acids are shown 
below. 

Single Letter Symbol Three Letter Symbol Amino Acids 



A 


Ala 


Alanine 


C 


Cys 


Cysteine 


D 


Asp 


Aspartic Acid 


E 


Glu 


Glutamic Acid 


F 


Phe 


Phenylalanine 


G 


Gly 


Glycine 


H 


His 


Histidine 


I 


Ee 


Isoleucine 


K 


Lys 


Lysine 


L 


Leu 


Leucine 


M 


Met 


Methionine 


N 


Asn 


Asparagine 


P 


Pro 


Proline 
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Single Letter Symbol Three Letter Symbol Amino Acids 



Q 
R 
S 
T 
V 
W 
Y 



Gin 
Arg 
Ser 
Thr 
Val 
Trp 

Jyr. 



Glutamine 

Arginine 

Serine 

Threonine 

Valine 

^Tryptophan 

Tyrosine 



Amino acid "chemical characteristics" are defined as: Aromatic (F,W, Y); 
Aliphatic-hydrophobic (L, I, V, M); Small polar (S, T, C); Large polar (Q, N); Acidic (D, : 
E); Basic (R, H, K); Proline; Alanine; and Glycine. 



Acronyms used herein are as follows: 



10 



15 



20 



25 



APC 
CD3 
CD4 
CD8 
CEA 
CFA 
CTL: 
DC: 



DMSO: 

ELISA: 

E:T: 

FCS: 

G-CSF: 

GM-CSF: 

HBV: 



Antigen presenting cell 

Pan T cell marker 

Helper T lymphocyte marker 

Cytotoxic T lymphocyte marker 

Carcinoembryonic antigen 

Complete Freund's Adjuvant 

Cytotoxic T lymphocytes 

Dendritic cells. DC functioned as potent antigen presenting cells 

by stimulating cytokine release from CTL lines that were specific 

for a model peptide derived from hepatitis B virus (HBV). In 

vitro experiments using DC pulsed ex vivo with an HBV peptide 

epitope have stimulated CTL immune responses in vitro 

following delivery to naive mice. 

Dimethylsulfoxide 

Enzyme-linked immunosorbant assay 

Effector:target ratio 

Fetal calf serum 

Granulocyte colony-stimulating factor 

Granulocyte-macrophage (monocyte)-colony stimulating factor 
Hepatitis B virus 
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HER2/Neu: c-erbB-2 

HLA: Human leukocyte antigen 

HLA-DR: Human leukocyte antigen class II 

HPLC: High Performance Liquid Chromatography 

5 HTC: Helper T cells 

HTL: Helper T Lymphocyte 

ID: Identity 

IF A: Incomplete Freund's Adjuvant 

IFNy: Interferon gamma 

10 IL-4: Interleukin-4 cytokine 

IV: Intravenous 

LU 30 %: Cytotoxic activity required to achieve 30% lysis at a 1 00: 1 

(E:T) ratio 

MAb: Monoclonal antibody 

1 5 MAGE: Melanoma antigen 

MLR: Mixed lymphocyte reaction 

MNC: Mononuclear cells 

PB: Peripheral blood 

PBMC: Peripheral blood mononuclear cell 

20 SC: Subcutaneous 

S .E.M. : Standard error of the mean 

QD : Once a day dosing 

TAA: Tumor associated antigen 

TCR: T cell receptor 

25 TNF: Tumor necrosis factor 

WBC: White blood cells 



This application may be relevant to U.S.S.N. 09/189,702 filed 1 1/10/98, which is a 
CIP of U.S. S.N 08/205,713 filed 3/4/94, which is a CEP of 08/159,184 filed 11/29/93 and 
now abandoned, which is a CIP of 08/073,205 filed 6/4/93 and now abandoned, which is a 
30 CIP of 08/027,146 filed 3/5/93 and now abandoned. The present application is also related 
to U.S.S.N. 09/226,775, which is a CIP of U.S.S.N. 08/815,396, which claims the benefit of 
U.S. S.N. 60/013,1 13, now abandoned. Furthermore, the present application is related to 
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U.S.S.N. 09/017,735, which is a CIP of abandoned U.S.S.N. 08/589,108; U.S.S.N. 
08/753,622, U.S.S.N. 08/822,382, abandoned U.S.S.N. 60/013,980, U.S.S.N. 08/454,033, 
U.S.S.N. 09/1 16,424, and U.S.S.N. 08/349,177. The present application is also related to 
U.S.S.N. 09/017,524, U.S.S.N. 08/821,739, abandoned U.S.S.N. 60/013,833, U.S.S;N. 
5 08/758,409, U.S.S.N. 08/589,107, U.S.S.N. 08/451,913, U.S.S.N. 08/186,266, U.S.S.N. 
09/1 16,061, and U.S.S.N. 08/347,610, which is a CEP of U.S.S.N. 08/159,339, which is a 
CIP of abandoned U.S.S.N. 08/103,396, which is a CIP of abandoned U.S.S.N. 08/027,746* 
which is a CIP of abandoned U.S.S.N. 07/926,666. The present application may also be 
relevant to U.S.S.N. 09/017,743, U.S.S.N. 08/753,615; U.S.S.N. 08/590,298, U.S.S.N. 

10 09/1 15,400, and U.S.S.N. 08/452,843, which is a CEP of U.S.S.N. 08/344,824, which is a 
CEP of abandoned US.S.N. 08/278,634. The present application may also be related to 
provisional U.S.S.N. 60/087,192 and U.S.S.N. 09/009,953, which is a CEP of abandoned 
U.S.S.N. 60/036,713 and abandoned U.S.S.N. 60/037,432. In addition, the present 
application may be relevant to U.S.S.N. 09/098,584, and U.S.S.N. 09/239,043. The present 

15 application may also be relevant to co-pending U.S.S.N. 09/583,200 filed 5/30/00, U.S.S.N. 
09/260,714 filed 3/1/99, and U.S. Provisional Application "Heteroclitic Analogs And 
Related Methods," Attorney Docket Number 018623-015810US filed 10/6/00. All of the 
above applications are incorporated herein by reference. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates data on three different multi-epitope constructs, incorporating 20 
to 25 different CTL epitopes each. 

Figure 2 illustrates two different synthetic polypeptides (Fig. 2a) where the first 
construct incorporates four different epitopes linearly cosynthetized, and the second 
25 construct incorporates a GPGPG spacer. Fig. 2b illustrates the capacity of 2 nanomoles of 
these different constructs to prime for proliferative responses to the various epitopes in IA b 
positive mice, compared to the responses induced by equimolar amounts of a pool of the 
same peptides (3 micrograms of each peptide). 

Figure 3 depicts the structure of multi-epitope DNA constructs. The HLA restriction 
30 is shown above each epitope, the A*0201 epitopes are bolded. The HLA binding affinity 
(IC 5 o nM) is provided below each epitope, (a) Schematic of HIV-FT illustrating order of the 
encoded epitopes, (b) Schematics of the of the HBV-specific constructs. The C+l amino 
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acid relative to Core 18 is indicated with an arrow. The HBV-specific constructs with 
single amino acid insertions at the Ci position of Core 18 are illustrated as HBV. IX. 

Figure 4 illustrates the immunogenicity of the HLA-A*0201 epitopes in HTV-FT in 
HLA-A*0201/K b transgenic mice, (a) Representative CTL responses against epitopes Pol 
5 498 (circles), Vpr 62 (triangle), Gag 386 (squares). Cytotoxicity was assayed in a 51 Cr 

release assay against Jurkat-HLA-A*0201/K b target cells in the presence (filled symbols) or 
absence (open symbols) of each peptide, (b) Summary of CTL responses of 
immunogenicity of HTV-FT in HLA-A*0201/K b transgenic mice. Bars indicate the 
geometric mean CTL response of positive cultures. The frequency of positive CTL 

10 cultures is also indicated. 

. Figure 5 shows the influence of the C+l amino acid on epitope immunogenicity. A 
database incorporating CTL responses from a variety of multi-epitope constructs 
representing 94 epitope/C+1 amino acid combinations was analyzed to determine the 
frequency (%) of instances in which a particular combination was associated with an 

15 optimal CTL response. CTL responses were considered optimal if greater than 100 SU or 
20 LU in at least 30% of the cultures measured. The number of times a given epitope/C+1 
amino acid combination was observed is also provided. 

Figure 6 shows CTL responses to HBV-specific constructs (a) CTL responses to 
Core 18 epitope following DNA immunization of HLA-A*0201/K b transgenic mice, (b) 

20 CTL responses to HBV Core 1 8 following DNA immunization of HLA-A*0201/K b 

transgenic mice with constructs which vary by a single amino acid insertion at the C+l . 
position of Core 18. 

Figure 7 shows levels of HBV Core 18 presentation in HBV.l (shaded bars) and 
HBV. IK (hatched bars) transfected cell lines. Epitope presentation was quantified using 
25 peptide-specific CTL lines. Presentation of HBV Pol 455 is shown for comparative 
purposes. 

Figure 8 depicts data for 221 A2K b target cells transfected with the HIV-FT 
EpiGene™. These transfected cells were assayed for their capacity to present epitopes to 
CTL lines derived from HLA transgenic mice and specific for various HTV-derived CTL 
30 epitopes. To correct for differences in antigen sensitivity of different CTL lines, peptide 
dose titrations, using untransfected cells as APC, were run in parallel. 



27 



WO 02/083714 



PCTYUS02/09877 



Figure 9 shows HIV multi-epitope constructs optimized using the methods of the 
present invention. 

Figure 10 illustrates a computer system for performing automatic optimization of 
multi-epitope constructs in accordance with one embodiment of the invention. 
5 Figures 1 1 A-B illustrate an exemplary input text file containing user input 

parameters used for executing a Junctional Analyzer program, in accordance with one 
embodiment of the invention. 

Figure 12 illustrates a flow chart diagram of a software program for identifying 
optimal multi-epitope constructs, in accordance with one embodiment of the invention. 
1 0 Figures 1 3 A-D illustrate an exemplary output text file containing output results of a 

Junctional Analyzer program, in accordance with one embodiment of the invention. 

Figure 14A depicts CTL responses induced by EP-HTV-90 relative to individual 
peptides in DFA, and Figure 14B depicts CTL responses induced by PfCTL.l, PfCTL.2, and 
PfCTL.3 relative to individual peptides. 
15 Figure 15 shows the effect of GPGPG spacers in class II epitope constructs HIV 

75mer and HIV 60mer on HTL responses to particular epitopes. 

Figure 16 depicts HTL responses to particular epitopes present in the EP-HIV- 
1043 -PADRE® construct. 

Figure 17 is a schematic depicting the epitopes present in HIV 75mer, EP-HIV- 
20 1043, and EP-JEV-1043-PADRE® 

Figures 18A-N show the amino acid sequences and nucleic acid sequences of 
certain multi-epitope constructs. 

Figures 19A-D show the amino acid sequences for epitopes present in certain multi- 
epitope constructs. 

25 Figures 20A-20F show the HBV CTL epitopes used to construct three related Epi 

Genes™, HBV-2, HBV-2 A and HBV-2B, the order of epitopes in the EpiGenes™, the 
immune responses induced in HLA-A2 or HLA-A3/1 1 transgenic mice and the amino acid 
and nucleic acid sequences of the EpiGenes™. In Figure 20B, the signal sequence in 
HBV-2, HBV-2A and HBV-2B is the IgK consensus signal sequence, although other signal 

30 sequences could be utilized. 

Figures 21A-21E show the HBV CTL epitopes used to construct two 21 CTL 
epitope EpiGenes™, HBV-21 A and HBV-21B, the order of epitopes in the EpiGenes™, 
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the immune responses induced in HLA-A2 or HLA-A3/1 1 transgenic mice and the amino 
acid and nucleic acid sequences of the EpiGenes™. 

Figures 22A-22E show the HBV CTL epitopes used to construct two 30 CTL 
. epitope EpiGenes™, HBV-30B and HBV-30C, the order of epitopes in the EpiGenes™, 
5 the immune responses induced in HLA-A2 or HLA-A3/1 1 transgenic mice and the amino 
acid and nucleic acid sequences of the EpiGenes™. 

Figures 23 A-23C show the modifications made to spacers flanking two HLA-A2 
restricted CTL epitopes in the HBV-30C EpiGene™. Modifications were designed to 
increase the efficiency of processing and subsequent presentation and thus, increase 
10 immunogenicity of the epitopes. Immunogenicity was measured using HLA-A2 or HLA- 
A3/1 1 transgenic mice, and the amino acid and nucleic acid sequences of the EpiGene™ 
are noted. In Figure 23 A, the lysine (K) spacer flanking the Core 1 8 epitope in HB V-30C 
were modified to include three alanine residues (AAA) in HBV-30CL. Also, one 
asparagine (N) spacer flanking env 183 epitope in HBV-30C was modified to include three 
15 alanine, residues (AAA) in HBV-30CL. 

Figures 24A-24C show HTL epitopes, and their binding affinity to selected HLA- 
DR alleles, used to construct a multi-epitope vaccine comprising HTL epitopes separated 
by GPGPG amino acid spacers. The nucleic acid sequence of the multi-epitope vaccine 
and the amino acid sequence encoded by the nucleic acid are shown in Figure 24C. 

20 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The invention is described in detail below with reference to the figures wherein like 
elements are referenced with like numerals throughout. The invention relates to a method 
and system of designing multi-epitope vaccines with optimized immunogenicity. In 

25 preferred embodiments, the vaccine comprises CTL and HTL epitopes. Vaccines in 
accordance with the invention allow for significant, non-ethnically biased population 
coverage, and can preferably focus on epitopes conserved amongst different viral or other 
antigenic isolates. Through the method and system disclosed herein, vaccines can be 
optimized with regard to the magnitude and breadth of responses, and can allow for the 

30 simplest epitope configuration. Finally, general methods are provided to evaluate 
immunogenicity of a multi-epitope vaccine in humans. 
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The method of the invention comprises designing a multi-epitope construct based 
on principles identified herein. In one aspect, the invention provides for simultaneous 
induction of responses against specific CTL and HTL epitopes, using single promoter 
multi-epitope constructs. Such constructs can contain many different epitopes, preferably 
5 greater than 10, often greater than 20, 25, 30, 25, 40, 45, 50, 55, 60, 65, 70, or more. 

In a preferred embodiment, a computer system identifies one or more optimal 
multi-epitope constructs by performing the following functions and/or analyses: 

(i) the epitopes to be incorporated into the multi-epitope construct are sorted to 
provide an order that minimizes the number of junctional epitopes formed. A more 

10 detailed discussion of this sorting procedure is provided below with reference to Figures 1 1 
and 12. Preferably, as a secondary consideration in ordering epitopes, epitopes are 
positioned such that residues at the N-terminus of an epitope that promote CTL 
immunogenicity are juxtaposed to the C-terminus of another CTL epitope. 

(ii) flanking residues that enhance immunogenicity may be inserted at the flanking 
15 positions of epitopes. In particular embodiments, flanking residues are inserted at the C+l 

position of CTL epitopes. 

(iii) spacer sequences may be inserted between epitopes to prevent occurance of 
junctional epitopes. In particular embodiments, the spacer sequences can also include a 
residue that promotes immunogenicity at the N-terminus of the linker such that the residue 

20 flanks the C-terminus of a CTL epitope. 

In particular embodiments to prevent HTL junctional epitopes, a spacer composed 
of amino acid residues that do not correspond to any known HLA Class II anchor residue, 
are used, e.g, alternating G and P residues (a GP spacer) is included between two HTL 
epitopes. 

25 Another aspect of the invention, (consideration (ii) above) involves the 

introduction or substitution of particular amino acid residues at positions that flank >■ 
epitopes, e.g., a position immediately adjacent to the C-terminus of the epitope, thereby 
generating multi-epitope constructs with enhanced antigenicity and immunogenicity 
compared to constructs that do not contain the particular residue introduced or substituted 

30 at that site, i.e., non-optimized multi-epitope constructs. The methods of optimizing multi- 
epitope constructs comprise a step of introducing a flanking residue, preferably K, N, G, R, 
or A at the C+l position of the epitope, i.e., the position immediately adjacent to the C- 
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terminus of the epitope. In an alternative embodiment, residues that contribute to 
decreased immunogenicity, i.e., negatively charged residues, e.g., D, aliphatic residues (I, 
L, M, V) or aromatic non-trytophan residues, are replaced. The flanking residue can be 
introduced by positioning appropriate epitopes to provide the favorable flanking residue, or 
5 by inserting a specific residue. 

As noted in the background section, multi-epitope constructs (minigenes) encoding 
up to 10 epitopes have been used to induce responses against a number of different 
epitopes. The data relating to an experimental multi-epitope construct, pMin .1 has been 
published (Ishioka et al., J Immunol Vol. 162(7):39 15-25 (1999)). Disclosed herein, are 

1 0 parameters for designing and evaluating multi-epitope constructs with optimized 
immunogenicity that address myriad disease indications of interest. 

Design parameters were identified based on a number of studies. In a preliminary 
evaluation of multi-epitope constructs, data on three different multi-epitope constructs, 
incorporating 20 to 25 different CTL epitopes each, are presented (Fig. 1). One construct is 

15 based on HIV-derived epitopes, (HIV-1), while the other two incorporate HCV-derived 
epitopes (HCV1 and HCV2, respectively). The immunogenicity of these different multi- 
epitope constructs has been measured in either A2 or Al 1 HLA transgenic mice (Al, A24 
and B7 restricted epitopes were not evaluated). 

Thus, eleven days after a single i.m. DNA vaccine injection, responses against 8 to 

20 14 different representative epitopes were evaluated following a single six day in vitro 

restimulation, utilizing assays to measure CTL activity (either chromium release or in situ 
IFN production, as described herein). Priming of epitope specific CTL could be 
demonstrated for 6/8 (75%), 10/14 (72%) and 13/14 (93%) of the epitopes tested in the case 
of HIV-1, HCV1 and HCV2, respectively. Thus, multi-epitope constructs, capable of 

25 simultaneously priming CTL responses against a large number of epitopes, can be readily 
designed. However, it should be emphasized that CTL priming for some epitopes was not 
detected and, in several of the 36 cases considered, responses were infrequent, or varied 
significantly in magnitude over at least/three orders of magnitude (1000-fold). These 
results strongly suggested that a more careful analysis and optimization of the multi-epitope 

30 constructs was required. 

The possibility that the suboptimal performance of priming for certain epitopes 
might be related to multi-epitope construct size was also examined. In fact, most of the 
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published reports describe multi-epitope construct of up to ten epitopes, and the few 
instances in which 20-epitope constructs have been reported, activity directed against only 
two or three epitopes was measured. To address this possibility, two smaller EpiGenes™ 
(HTV-1.1 and HTV-1.2) each encompassing ten epitopes, and corresponding to one half of 
5 the HIV-1 EpiGene™, were synthesized and tested. Responses against four representative 
epitopes were measured. 

Table 1. hnmunogenicitv appears to be independent of EpiGene™ size . 

CTL response to different EpiGenes™ 
HIV1 ,HIV1.1 HIV 1.2 

CTL (20mer) (lOmer) (lOmer) 

Epitope Frequency 1 * Magnitude 2 Frequency Magnitude Frequency Magnitude 



Pol 774 


0/8 


* 


0/4 


* 


NA 3; 


NA 


Pol 498 


18/19 


46.7 


4/4 


16.4 


NA 


NA 


Gag 271 


4/13 


4.0 


NA 


NA 


0/4 


* 


Env 134 


5/8 


16.1 


NA 


NA 


4/4 


14.8 



10 1 ) Represents the fraction of independent cultures yielding positive responses 

2) Lytic Units (LU) 

3) Not Applicable 

It was found that the responses induced by the smaller EpiGenes™ were 
15 comparable, and if anything, lower than those induced by the twenty-epitope construct . 

(Table 1). Accordingly, factors relating to EpiGene™ size are unlikely explanations for the; 
observed suboptimal priming to certain epitopes and thus other parameters, disclosed 
herein, are used to design efficacious multi-epitope constructs. 

20 The minimization of junctional motifs 

One of the considerations in designing multi-epitope constructs is the inadvertent 
creation of junctional epitopes when placing epitopes adjacent to each other. The presence 
of such epitopes in a multi-epitope construct could significantly affect performance. 
Strategies to guard against this undesired effect are disclosed herein for application to the 

25 development of multi-epitope vaccines. Junctional epitopes can first be minimized by 
sorting the epitopes to identify an order in which the numbers of junctional epitopes is 
minimized. Such a sorting procedure can be performed using a computer or by eye, if 
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necessary, or depending on the number of epitopes to be included in the multi-epitope 
construct. 

For example, a computer program that finds patterns, e.g., Panorama, manufactured 
by ProVUE Development, Huntington Beach, California, U.S.A., can be used in 
5 accordance with one embodiment of the invention. A very large number of different 
epitope arrangements can be considered in designing a particular multi-epitope construct. 
A computer program accepts as input, the particular set of epitopes considered, and the 
motifs to be scanned in order to evaluate whether there are any junctional epitopes bearing 
these motifs. For example, a program can simulate building a multi-epitope construct, and 

10 in an euristic computational algorithm, examine epitope pairs to avoid or minimize the 
occurrance of junctional motifs. The program can for example, evaluate 6 X 10 5 (about 
half a million) multi-epitope construct configurations/second. 

A complete analysis of a 10-epitope construct using a computer program as 
described in the preceding paragraph requires examining 10 factorial = 3.6X10 6 

15 combinations and can be completed in six seconds. A fourteen-epitope construct can be 
completely analyzed in a couple of days. Thus, analysis time goes up very rapidly as larger 
constructs are considered. However, a complete analysis is not always required and the 
program can be run for any desired length of time. In either case, the computer system of 
the present invention identifies and provides at least one configuration having a minimum 

20 number of junctional epitopes. 

An example of the results of this type of approach is presented in Table 2. The 
number of junctional motifs in ten different random assortments of the same epitopes 
contained in the HCV1 EpiGene™, which incorporates 25 epitopes, and is the result of a 
two day computer analysis, is presented in this Table. In the non-optimized assortments, a 

25 large number of A2, Al 1 and K b motifs were found, in the 25 to 38 range, with an average 
of 31. By comparison, only two such junctional motifs are present in the HCV1 EpiGene™ 
assortment. In conclusion, a computer program can be utilized to effectively minimize the 
number of junctional motifs present in multi-epitope constructs. 



Table 2. Occurrence of junctional epitopes. 



EpiGene™ 


selection criteria 


junctional motifs 


HCV.a 


random 


33 
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bcjccuon cntcud 


J UilvAlUIloI lIIULJJLo 


XI V . V 


random 




XI I— V .c 


random 


zo 


xi V-/ v .a 


ranaom 


97 
z / 


xiv_y v .e 


random 




XlC V .1 


random 


ZO 


TTPV a 
XH^ V .g 


■t*oTi /I r\tY^ 

ranuom 


-3C 
JO 


HCV.h 


random 


33 


HCV.i 


random 


33 


HCV.j 


random 


34 


HCV.I 


minimized 


2 



Eliminating Class II junctional epitopes and testing for Class II restricted responses in vivo 

As a further element in eliminating junctional epitopes, spacer sequences can be 
inserted between two epitopes that create a junctional epitope when juxtaposed. 
5 In one embodiment, to correct the problem of junctional epitopes for HTL epitopes, 

a spacer of, for example, five amino acids in length is inserted between the two epitopes. 
The amino acid residues incorporated into such a spacer are preferably those amino acid 
residues that are not known to be primary anchor residues for any of the HLA Class II 
binding motifs. Such residues include G, P, and N. In a preferred embodiment, a spacer 

10 with the sequence GPGPG is inserted between two epitopes. Previous work has 

demonstrated that the GP spacer is particularly effective in disrupting Class II binding 
interactions (Sette et al., J. Immunol, 143:1268-73 (1989)). All known human Class II 
binding motifs and the mouse IA b (the Class II expressed by HLA transgenic mice) do not 
tolerate either G or P at this main anchor positions, which are spaced four residues apart. 

15 This approach virtually guarantees that no Class II restricted epitopes can be formed as 
junctional epitopes. 

hi an example validating this design consideration, we synthesized polypeptides 
incorporating HTV-derived HTL epitopes. These epitopes are broadly cross-reactive HLA 
DR binding epitopes. It was then determined that these epitopes also efficiently bind the 
20 murine IA b Class II molecule. A diagram illustrating the two different synthetic 
polypeptides considered is shown in Fig. 2a. 



34 



WO 02/083714 



PCT/US02/09877 



The first construct incorporates four different epitopes linearly arranged, while the 
second construct incorporates the GPGPG spacer. Synthetic peptides corresponding to the 
three potential junctional epitopes were also synthesized. 

The capacity of 2 nanomoles of these different constructs to prime for proliferative 
5 responses to the various epitopes in IA b positive mice was tested, and compared to the 
responses induced by equimolar amounts of a pool of the same peptides (3 micrograms of 
each peptide). Specifically, groups of 3 mice were injected with peptides in CFA 
emulsions, 1 1 days after injection their lymph node cells were cultured in vitro for an 
additional 3 days, and thymidine incorporation was measured in the last 24 hours of 

10 culture. It was found (Fig. 2b) that, as predicted on the basis of their high affinity IA b 

binding capacity, all four epitopes induced good proliferation responses. Stimulation index 
(SI) values in the 4.9 to 17.9 range were observed when these peptides were injected in a 
pool. However, when the linear polypeptide incorporating the same epitopes was tested, 
the response directed against Pol 335 was lost. This was associated with appearance of a 

15 response directed against a junctional epitope spanning Gag 171 and Pol 335. The use of 
the GPGPG spacer eliminated this problem, presumably by destroying the junctional 
epitope, and the Pol 335 response was regained. The responses observed were of 
magnitude similar to those observed with the pool of isolated peptides. 

These results demonstrate that responses against multiple HIV-derived Class II 

20 epitopes can be simultaneously induced, and also illustrate how IA b /DR crossreactivity can 
be utilized to investigate the immunogenicity of various constructs incorporating HTL 
epitope candidates. Finally, they demonstrate that appropriate spacers can be employed to 
effectively disrupt Class II junctional epitopes that would otherwise interfere with effective 
vaccine immunogenicity. 

25 In the case of Class I restricted responses, one case of a naturally occurring 

junctional epitope and the consequent inhibition of epitope specific responses has been 
presented by McMichael and coworkers (Tussey et aL, Immunity, Vol. 3(l):65-77 (1995)). 
To address the problem of junctional epitopes for Class I, similar analyses can be 
performed. For example, a specific computer program is employed to identify potential 

30 Class I restricted junctional epitopes, by screening for selected murine motifs and for the 
most common human Class I HLA A and B motifs. 
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Spacer sequences can also similarly be employed to prevent CTL junctional 
epitopes. Often, very small residues such as A or G are preferred spacer residues. G also 
occurs relatively infrequently as a preferred primary anchor residue (see, e.g., 
PCT/USOO/24802) of an HLA Class I binding motif These spacers can vary in length, e.g. 
spacer sequences can typically be 1, 2, 3, 4, 5, 6, 7, or 8 amino acid residues in length and 
are sometimes longer. Smaller lengths are often preferred because of physical constraints 
in producing the multi-epitope construct. 

The influence of flanking regions on CTL multi-epitope construct immunogenicity 

Another factor to be considered in designing multi-epitope constructs is to insert 
residues that favor immunogenicity at the position flanking the C-terminus of a CTL 
epitope. 

Disclosed herein are studies that identify residues that increase immunogenicity 
and, accordingly, residues that are inserted in multi-epitope constructs to optimize 
immunogenicity. 

The molecular context in which an epitope was expressed often dramatically 
influenced the frequency and/or magnitude of priming of CTL specific for that epitope in 
HLA transgenic mice. Two examples are shown in Table 3. . 
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Table 3. Differences in effectiveness of T cell priming for specific epitopes in different 
EpiGenes™. 







Flanking 
Sequence 


Epitope 


Flanking 
Sequence 


Immune 
Response 


Immune 
Response 


Epitope 
Identity 


EpiGene 

TM 

Identity 


(N terminus) 


Sequence 


(C-terminus) 


Frequency Magnitude 0 


Core 18 


HBV.l 


TLKAAA 


FLPSDFFPSV 


FLLSLG 


6/6 


5.5 




pMinl 


TLKAAA 


FLPSDFFPSV 


KLTPLC 


6/6 


1074.5 


Core 132 


HCV1 


ILGGWV 


DLMGYEPLV 


"YLVAYQ 


2/12 


107.7- 




HCV2 


VPGSRG 


DLMGYDPLV 


AKFVA 


17/18 


929.2 



1) IFNy secretory units 

The immunogenicity of the HB V Core 1 8 epitope expressed in the pMin5 
EpiGene™ was approximately 200-fold lower in magnitude than that observed in the case 
of the pMinl EpiGene™. Similarly, the immunogenicity of the HCV Core 132 epitope 
expressed in the context of the HCV1 EpiGene™ was marginal, with significant T cell 
priming demonstrable in only 2 of 12 different independent CTL experiments/cultures 
performed. These two positive experiments yielded responses of approximately 100SU of 
IFNy. However, when the same epitope was expressed in the context of the HCV2 
EpiGene™, positive responses were observed in 17/18 cases, and with average magnitudes 
approximately five-fold higher. 



Immunogenicity of HIV-FT in HLA-A*0201/Kb transgenic mice 
15 An HIV multi-epitope DNA vaccine, HIV-FT (Fig. 3a) encodes 20 HIV-derived 

CTL epitopes. Of these 20 epitopes, eight are restricted by HLA-A*0201, nine by HLA- 
A*l 101 and three by HLA-B*0702. All epitopes bound their relevant restriction element 
with high or moderate affinity. All of the HLA-A*0201 restricted epitopes bound purified 
HLA-A*0201 molecules with roughly similar affinities, with IC 5 o values in the 19-192 nM 
20 range (Fig. 3a). The HLA-A*0201 epitopes chosen for inclusion in HIV-FT are recognized 
in HIV-1 infected individuals and were also highly effective in priming for recall CTL 
responses when emulsified with IFA and utilized to prime HLA-A*0201/K b transgenic 
mice. The construct was designed to encode the epitopes sequentially without any 
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intervening spacer sequences between them and a consensus IgK signal sequence was fused 
to the 5' end of the construct to facilitate transport of the encoded antigen into the 
endoplasmic reticulum (Ishioka et al., J. Immunol. 162:3915-3925, 1999). 

The ability of HIV-FT to prime recall CTL responses in vivo was evaluated by 
5 intramuscular immunization of HLA-A*0201/K b transgenic mice. Splenocytes from 

animals immunized with lOOjig of HIV-FT plasmid DNA were stimulated with each of the 
HLA-A*0201 epitopes encoded in HIV-FT and assayed for peptide-specific CTL activity r 
after six days of culture. Representative CTL responses against three of the epitopes in 
HIV-FT are shown in Fig. 4a. To more conveniently compile results from different 

10 experiments the percent cytotoxicity values for each splenocyte culture were expressed in 
lytic units (Vitiello, et al., J. Clin. Invest 95:341-349, 1995). Of the eight HLA-A*0201 
restricted epitopes encoded in HIV-FT, Pol 498, Env 134, Pol 448, Vpr 62, Nef 221, and 
Gag 271, primed for CTL responses following DNA immunization, (Fig. 4b). The 
magnitude of the CTL responses varied over greater than a 10-fold range, from as high as 

15 nearly 50 LU against Pol 498, too as little as 4 LU against Nef 221 and Gag 271 . Similarly, 
the frequency of recall CTL responses varied between epitopes, with the Pol 498 epitope 
inducing responses in 94% of the experiments while CTL responses to Gag 271 were 
detected in only 31% of the experiments. In conclusion, DNA immunization with HIV-FT, 
which sequentially encodes the epitopes without any spacer amino acids, induced recall 

20 CTL responses against the majority of the epitopes analyzed. However, the magnitude and 
the frequency of the responses varied greatly between epitopes. 

Correlation Between Epitope Immunogenicitv and Levels of HIV-FT Epitope 
Presentation In Transfected Cell Lines 

The differential immunogenicity of the HLA-A*0201 epitopes in HIV-FT was then 
25 assessed. Differential MHC binding affinity could be excluded as all of the epitopes bind 
HLA-A*0201 with high affinity (Fig. 3a). In addition, lack of a suitable repertoire of TCR 
specificities in HLA-A*0201/K b transgenic mice could be excluded since all epitopes 
yielded comparable CTL responses following immunization of HLA transgenic mice with 
the optimal preprocessed peptide emulsified in IFA. Variations in the relative amounts of 
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each epitope presented for T cell recognition may account for the differences in epitope 
immunogenicity. 

To test this, Jurkat cells, a human T cell line, expressing the HLA-A*0201/K b gene 
(Vitiello et al., J. Exp, Med. 173, 1007-1015, 1991) were transfected with the HTV-FT 
expressed in an episomal vector. A human cell line was selected for use to eliminate any 
possible artifacts that may be associated with potential differences in the processing - 
capabilities between humans and mice. This transfected cell line matches the human MHC 
presentation with human antigen processing capabilities and provides support for the 
subsequent development of CTL epitope-based DNA vaccines for use in humans. 

Peptide-specific CTL lines detected presentation in the transfected targets of four of 
the HLA-A*0201 epitopes encoded in the HIV-FT, Pol 498, Env 134, Pol 448 and Nef 221. 
To quantitate the level at which each of these epitopes was produced and presented, the 
CTL lines specific for the various epitopes were incubated with untransfected targets and 
variable amounts of each epitope or peptides. These CTL dose response curves were 
utilized as standard curves to determine the peptide concentration inducing levels of IFNy 
secretion equivalent to those observed in response to the HIV-FT transfected target cells. 
This value is referred to as a "peptide equivalent dose" and taken as a relative measure of 
the amount of epitope presented on the transfected cell. 

Table 4 summarizes the findings of this analysis for eight of the HLA-A*0201 
epitopes encoded in the HIV-FT. Peptide equivalent doses varied from a high of 33.3 
ng/ml for Nef 221 to less than 0.4 ng/ml peptide equivalents for epitopes Gag 271, Gag 386 
and Pol 774. Cumulatively these results indicate that in human cells lines transfected with. 
HIV-FT there is at least a 100-fold variation exists in the levels of presentation of the 
different HLA-A*0201 restricted epitopes. All of the epitopes that were presented at 
detectable levels in antigenicity assays were also immunogenic in vivo. The only epitope 
that was immunogenic and not antigenic was Gag 271. In this case, immunization of HLA- 
A*0201/Kb transgenic mice with HIV-FT induced a weak CTL response in less than a third 
of the cultures tested. The other two, epitopes, which were presented below the limit of 
sensitivity for the antigenicity analysis, Gag 386 and Pol 774, were non-immunogenic. In 
conclusion these results suggest that the heterogeneity in CTL responses induced by HTV- 
FT immunization can at least in part be attributed to suboptimal epitope presentation. 
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Table 4: Comparison of HIV-FT immunogenicity and antigenicity 



Epitope 


HIV-FT Immunogenicity 


HIV-FT Antigenicity 




magnitude 


n 2 

frequency^ 


Peptide 

.1.3 

Equivalents 


4 

n 


Pol 498 


58.8 (2.2) 


94% (16/17) 


23.8 (2.0) 


4 


Env 134 


i a 1 r\\ 
lo. 1 p.UJ 


/TOO/ /^/Q\ 


f. "> /i 

O.z ll.ZJ 




rui *T*ro 


15.7(2.6) 


54% (7/13) 


24.7 (3.9) 


3 


Vpr62 


9.9 (1.9) 


83% (10.12) 


ND 




Nef 221 


4.4 (1.3) 


78% (7/9) 


33.3 (6.0) 


3 


Gag 271 


4.0 (1.4) 


31% (4/13) 


<0.4 


6 


Gag 386 


0 


■ 0%(0/17) 


<0.4 


3 


Pol 774 


0 


0% (0/8) 


<0.4 


1 



1 magnitude expressed as LU (ref); the correlation coefficient relative to peptide 
equivalents R+0.44 

2 frequency of positive cultures (number cultures >2LU/total tested); the correlation 
5 coefficient relative to peptide equivalents R+0.8. 

3 magnitude expressed in ng/ml 

4 number of independent experiments 

Flanking amino acids influence CTL epitope immunogenicity in vivo following vaccination 
10 As described herein, the particular amino acids flanking individual CTL epitopes is 

one factor that influences or enhances the efficiency with which an epitope is processed by 
altering the susceptibility of the antigen to proteolytic cleavage. To examine the influence 
of flanking amino acids on epitope immunogenicity, immunogenicity data was obtained 
from HLA-A*0201, -A*l 101 and -B*0701 transgenic mice immunized with a number of 
15 unrelated experimental multi-epitope DNA constructs encoding minimal CTL epitopes 
without intervening sequences. A database representing 94 different epitope/flanking 
residue combinations was compiled to determine the possible influence the immediately 
flanking amino acids on epitope immunogenicity. A given epitope and flanking amino acid 
combination was included only once to prevent artificial skewing of the analysis because of 
20 redundancies. Epitope immunogenicity in HLA transgenic was considered optimal if 
greater than 100 SU or 20 LU in at least 30% of the cultures measured. CTL responses 
were typically scored in one of four categories: (+++), outstanding-more than 200 LU or 
1000 SU; (++), good-20-200 LU or 100-1000 SU; (+), intermediate-2 to 20 LU or 10 to 
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100 SU; and (+/-), weak or negative-less than 2 LU or 10 SU. The numbers of optimal 
versus sub-optimal responses were categorized based on the chemical type of amino acid in 
the flanking positions and the significance of differences were determined using a chi- 
square test. 

5 This analysis did not find any associations between the type of amino acids present ' 

at the amino-terminus of an epitope and immunogenicity. However, significant effects of 
the carboxyl-terminus flanking residue, the C+l residue, were identified. Positively 
charged amino acids, K or R were most frequently associated with optimal CTL responses, 
a frequency of 68% (Fig 5). The presence of amino acids N and Q at the C+l residue was « 

10 also associated with strong CTL responses in 55.5% of the cases examined; when epitopes 
were flanked at the C+l position by N, they induced optimal CTL responses in 3/4 cases. 
In general, small residues such as C, G, A, T, and S promoted intermediate CTL responses 
inducing strong responses in 54% of the combinations available for analysis. Conversely, 
epitopes flanked by aromatic and aliphatic amino acids induced optimal in vivo responses 

15 in only 36% and 17% of the cases, respectively. The negatively charged residue, D, 
yielded a suboptimal CTL response. The influence of C+l amino acid on epitope 
immunogenicity was found to be statistically significant using a chi-square test (PO.03). 
No significant influence on epitope immunogenicity was noted when similar analysis was 
performed for C-terminal residues more distal than the C+l position. 

20 

Direct Evaluation of the Effect of the CI Residue On Epitope Immunogenicity 

To directly evaluate the effect of preferred versus deleterious types of amino acids 
in the C+l flanking position, two multi-epitope constructs, referred to as HBV.l and 
HBV.2 (Fig 3b) were evaluated. As with HIV-FT, these HBV constructs encode the 
25 epitopes sequentially without intervening spacers. Indeed, the HBV.l and HBV.2 were 
generated by replacing the HIV- 1 epitopes in pMinl , an experimental multi-epitope 
construct previously characterized (Ishioka, supra) with similar HBV-derived epitopes. 

For HBV.l, the HIV-1 epitope directly following the highly immunogenic HBV 
Core 18 epitope was replaced with the HBV Pol 562 epitope. This altered the C+l residue 
30 from a K to an F. The second construct, HBV.2, was produced by the insertion of an 
additional epitope, HBV Pol 629, between the HBV Core 18 and Pol 562 epitopes; a 
change that replaced the C+l amino acid with a K residue. When the immunogenicity of 
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the Core 18 epitope presented in these different contexts was evaluated in HLA-A*0201/K b 
transgenic mice, it was determined that the Core 18 epitope was virtually non- 
immunogenic in HBV.l but strongly immunogenic in HBV.2 (Fig. 6a). The reduction of in 
vivo immunogenicity for this epitope was as would be predicted by our previous analysis. 
5 To further test the effects of the Ol flanking amino acid on CTL epitope 

immunogenicity, a set of constructs that differ from HBV.l by the insertion of single amino 
acids at the C+l position relative to the Core 18 epitope (Fig. 3b) was evaluated. Little or 
no CTL response was observed against the Core 1 8 epitope when flanked at the C+l 
position by W, Y, or L (Fig 6b). In contrast, insertion of a single K residue dramatically ; 

10 increased the CTL response to Core 18. The responses were comparable to those observed 
in HBV.2 in which the Core 18 epitope is flanked by Pol 629, an epitope with a K at the N- 
terminus of the epitope. Enhancement of the Core 18 CTL response was also observed to 
insertion of R, C, N, or G. The effect of these insertions is specific, as the immunogenicity 
of other epitopes within these constructs did not exhibit significant changes in CTL 

1 5 responses (data not shown). In conclusion, these data indicate that the C+l amino acid can 
dramatically influence epitope immunogenicity. 

Variations in CTL Epitope Immunogenicity Are Correlated With The Amount Presented 
If the variation of the immunogenicity of Core 1 8 associated with different C+l 

20 residues was the result of differential sensitivity to proteolytic cleavage then large 

differences in the levels of epitope presentation should be detectable in different constructs. 
To test this, Jurkat cells, expressing the same HLA-A*0201/K b gene expressed in the 
transgenic mice, were transfected with an episomal vector expressing either HBV.l or 
HBV. 1 K. The Core 1 8 epitope was presented at >1 0 5 higher levels when a K was in the 

25 C+l position, compared to the presence of an F in the same position (Fig. 7). It is unlikely 
that this difference in Core 18 presentation is attributed to differences in gene expression 
between target cell lines since presentation of Pol 455 varied by less than ten-fold. These 
data demonstrate the striking effect that amino acids at the C+l position can exert on 
efficiency of epitope presentation in multi-epitope DNA vaccines. Thus, these data show 

30 that the immunogenicity of CTL epitopes in a DNA vaccine can be optimized through 
design considerations that affect the level of epitope presentation. This type of 
optimization is applicable to epitope-based vaccines delivered using other formats, such as 
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viral vectors as well as other expression vectors known to those of skill in the art, since the 
effects are exerted after the antigen is transcribed and translated. 

In summary, for flanking residues, it was found that either very small residues such 
as A, C or G, or large residues such as Q, W, K, or R were generally associated with good 
5 or outstanding responses. The absence of a C+l residue because of a stop codon in the 
multi-epitope construct, or the presence of intermediate size residues such as S or T was 
associated with a more intermediate response pattern. Finally, in the case of a negatively 
charged residue, D; aliphatic (V, I, L, M) or aromatic-non tryptophan residues (Y, F), 
relatively poor responses were observed. These results show that the particular residue 
1 0 flanking the epitope's C-terminus can dramatically influence the response frequency and 
magnitude. Flanking residues at the C+l position can also be introduced in combination 
with spacer sequences. Thus, a residue that favors immunogenicity, preferably, K, R, N, A; 
or G, is included as a flanking residue of a spacer. 

15 Sorting and Optimization of Multi-epitope Constructs 

To develop multi-epitope constructs using the invention, the epitopes for inclusion 
in the multi-epitope construct are sorted and optimized using the parameters defined herein. 
Sorting and optimization can be performed using a computer or, for fewer numbers of 
epitopes, not using a computer. 

20 Computerized optimization can typically be performed as follows. The following 

provides an example of a computerized system that identifies and optimizes, e.g., provides 
for a minimal number of junctional epitopes and a maximal number of flanking residues, 
epitope combinations. Figure 10 illustrates a computer system 100 for performing the 
optimization of multi-epitope constructs, in accordance with one embodiment of the 

25 invention. The computer system 100 may be a conventional-type computer which includes 
processing circuitry, e.g., a central processing unit (CPU), memory, e.g., a hard disk drive 
(ROM), a random access memory (RAM), cache, and other components, devices and 
circuitry (not shown) typically found in computers today. In a preferred embodiment, the 
computer system 100 includes, among other components and devices, a Macintosh G3 333 

30 MHz processor, a six Gigabit (GB) hard drive, 96 Megabits of RAM, and 512 Kilabits 

(KB) of cache memory, capable of searching 600,000 to 700,000 permutations per second. 
The computer system 100 further includes a monitor 102 for displaying text, graphics and 
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1 other information to a user and a keyboard 1 04 for allowing a user to input data, 
commands, and other information to the computer system 100. 

As shown in Figure 10, in one embodiment, the computer system 100 may 
communicate with one or more remote computers 1 50 through a computer network 160 
5 such that registered users at remote locations can perform the junctional analyses and 
multi-epitope construct optimization procedures described herein by logging on at the 
remote computers 150 and supplying a required password or user identification. The 
computer network 160 may be a local area network (LAN), a wide area network (WAN), or 
the world-wide web (i.e., Internet). These types of networks are well-known in the art and, 
1 0 therefore, a discussion of these networks and their communication protocols is not provided 
herein. 

In a preferred embodiment, the computer system 100 stores a software program, 
e.g., object code, in the hard drive memory of the computer system 100. This object code 
is executed by the CPU for performing the functions described herein. One component, or 

1 5 module, of the software program carries out the function of analyzing and identifying 

junctional epitopes at the peptide junctions of the polypeptide encoded by a multi-epitope 
nucleic acid construct as well as evaluating combinations of spacer and flanking residues at 
these junctions. This software module is referred to herein as the "Junctional Analyzer" 
module or program. Ih a preferred embodiment, the Junctional Analyzer analyzes and 

20 processes peptides entered by a user in accordance with other criteria, data and operating 
parameters described below. 

Figures 1 1 A-B (hereinafter Figure 11) illustrate an exemplary input text file 200 
containing user input data and parameters which is used by the Junctional Analyzer ■ 
program, in accordance with one embodiment of the invention. As shown in Figure 1 1, 

25 various types of input data are provided to the program. First, a user may enter a set of 
peptides or epitopes 202 for processing. A set of weights 204 for each amino acid when it 
appears in a C+l and N-l position is also entered into the text file by the user. In one 
embodiment, the weight values are determined by statistical or empirical analysis of 
experimental results which reflect the immunogenicity or antigenicity "enhancement" 

30 effects of each amino acid when it is placed at the C+l or N-l positions of a polypeptide. 
However, the assignment of weight values for each amino acid may be performed by any 
number of methodologies, including in vitro and in vivo studies, which would be apparent 
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v to those of ordinary skill in the art, depending on the desired criteria used to determine the 
weight values. Some examples of such experiments or studies are described in further 
detail below. 

In a preferred embodiment, a database containing different epitope/flanking residue 

5 combinations is stratified on the basis of epitope immunogenicity and the number of 
optimal versus suboptimal responses are sorted to rank the amino acids and assign 
enhancement weight values. The text file also contains a set of motifs 206 to use in 
detecting junctional epitopes. In a preferred embodiment, the user may also enter a 
maximum number of amino acids (spacers and flanking) to insert between each pair of. 

10 peptides (Maxlnsertions) 208 to function as spacers and/or flanking residues. Other 

parameters, values or commands (collectively referred to herein as "parameters") to control 
the operation of the program may also be entered such as, for example: "OutputToScreen 
(Y/N)" 210; "OutputToFile (Y/N)" 212; the minimum function value to accept as a valid 
result ("MinimumAccepted") 214; the maximum number of results having the same 

15 function value ( 66 MaxDuplicateFunctionValue") 216; the maximum time allowed for a 
search in minutes ("SearchTime") 218; whether an Exhaustive Search is desired 
("Exhaustive = Y/N") 220; the number of Stochastic search probes 
(''NumStochasticProbes*') 222; the maximum number of hits allowed per single probe 
during a stochastic search CTVIaxHitsPerProbe") 224; and whether the start of each probe 

20 should be random or other C t RandomProbeStart(Y/N) > ') 226. These parameters are 

provided for purposes of illustration only. Other parameters to control the operation and 
output format of the program may be entered as would be obvious to those of ordinary skill 
in the art. 

The motifs 206 in the text file 200 provide a "mask" or structural model for 
25 identifying junctional epitopes. For example the first motif 206a shown in Figure 11, 

XXXX(FY)XX(LIMV), defines an epitope that is eight amino acids in length. The value 
"X" indicates that any amino acid may be at that position of the epitope. The value "(FY)" 
indicates that either an F amino acid or a Y amino acid may be in the fifth position of the 
epitope. Similarly, "(LIMV)" indicates that any one of the listed amino acids, L, I, M or V, 
30 may be in the eighth position of the epitope. Therefore if a sequence of eight amino acids 
spanning a junction of two peptides satisfies the above motif criteria, it is identified as a 
junctional epitope. 

45 



WO 02/083714 



PCT/US02/09877 



Figure 12 illustrates a flow chart diagram of one embodiment of the Junctional 
Analyzer program. At step 301, the program receives user inputs and instructions for 
performing the junctional analysis operation. In a preferred embodiment, the program uses 
an input text file 200 as shown in Figure 1 1 to input parameters 202-226. As is well-known 
5 in the art, such a text file may be derived, for example, from a Microsoft Excel™ 

spreadsheet file or document, to specify desired input parameters (e.g., epitopes, motifs, 
flanking residue weight values, maximum number of hits, maximum search time, etc.) for 
its operation. At step 303, the Junctional Analyzer program generates a list of all epitope 
pairs. For example, if ten epitopes are entered by the user, there will be a total of ninety 

10 (10x9) epitope (peptide) pairs. Next, at step 305, for each pair of peptides or epitopes, the 
program determines the set of insertions that results in the minimum number of junctional 
epitopes and/or the maximum effect from the C+l and N-l contribution of spacing 
residues. To make this determination, the program calculates a function value for each 
possible combination of spacers for each peptide pair, where the number of spacers can 

15 range from 0 to Maxlnsertions 208 (Fig. 1 1) and any arrangement of known or prespecified 
amino acids may be considered. In a preferred embodiment, the function value is 
calculated using the following equation: F= (C + N)/J, where C is the enhancement weight 
value for a flanking amino acid located at the C+l position of an epitope, N is the 
enhancement weight value for a flanking amino acid located at the N-l position of an 

20 epitope, and J is the number of junctional epitopes present. Since multiple motifs may be 
satisfied at one junction of a peptide pair, J may be a number greater than one. When J=0, 
F = 2(C+N). This second equation was chosen because for a fixed value of (C+N), the 
function value F will double when J changes from two to one, and will double again when J 
changes from one to zero. It is understood, however, that the above equations are 

25 exemplary only and that other equations for evaluating peptide pairs can be easily added to 
the program at any time. Modifications or changes to the above equations, depending on 
the desired criteria for emphasis or evaluation, would be readily apparent to those of 
ordinary skill in the art. At step 307, the program outputs the optimum combination of 
insertions (spacing and/or flanking residues) for each pair of peptides and the maximum 

30 function value for each pair of peptides. In a preferred embodiment, at step 307, the output 
from this program is generated as an output text file that fists, for each pair of peptides, the 
insertion that yields the maximum function result. 

46 



WO 02/083714 



PCT/US02/09877 



Figures 13A-D (hereinafter Figure 13) illustrate an exemplary output text file 400 
that lists, for each peptide pair, the spacer combination having the maximum function 
value. In the example shown in Figure 13, eleven peptides, labeled A-K 202 (Fig. 1 1), 
were processed, the Motifs 206 were used to detect junctional epitopes, the enhancement 
5 weight values for each potential flanking residue 204 were used, and Maxlnsertions 208 
was set to four. Other parameters for controlling the operation and format of the Junctional 
Analyzer program were set as illustrated by the parameter settings 402. For purposes of 
convenience, in a preferred embodiment, these input parameters are repeated in the output 
text file 400. The output text file 400 includes an output table 404 which contain the results 

10 of steps 305 (Fig. 12). The first column (Col. 1) of the output table 404 indicates the first 
peptide of a pair. The second column (Col. 2) of the output table lists the first amino acid 
insertions which function both as a spacer and the C+l flanking amino acid. The third 
column lists a second spacer amino acid. The fourth column lists a third spacer amino 1 acid. 
The fifth column lists a fourth spacer amino acid which is also the N-l flanking amino acid 

15 for the second peptide of the pair which is listed in column six. The seventh column lists 
the enhancement weight value of the C+l flanking amino acid listed in column two. The 
eighth column lists the enhancement weight value of the N-l flanking amino acid listed in 
column six 412. The ninth column lists the sum of the C+l and N-l enhancement weight 
values. The tenth column lists the number of junctional epitopes found in the peptide pair 

20 and the eleventh column lists the maximum function value for the peptide pair based on the 
equations listed above. For example, the first row of the output table 404 shows that for the 
peptide pair A-B, corresponding to the peptides VLAEAMSQV - ILKEPVHGV, the spacer 
combination of three amino acids, CAL, eliminates all junctional epitopes and provides a 
maximum function value of 8.80. It is understood, however, that other output options may 

25 be implemented in accordance with the invention. For example, the output table 404 may 
show the top 32 results for each pair of peptides, or show every result for all possible 
insertions in the order evaluated, or trace the motif search process to generate large output 
files, depending on the level of detail and/or analysis desired by the user. 

In a preferred embodiment, the information contained in the output table 404 is used 

30 to perform either an "Exhaustive J Search" or a "Stochastic J Search" to identify a 

polypeptide construct linking all eleven peptides, including optimum spacer combinations. 
For eleven peptides, for example, there will be ten junctions. Therefore the permutation 
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. which yields the largest sum of function values taking into account all ten junctions is 
identified as the "optimum" permutation(s) of the multi-epitope constructs. In one 
embodiment, for the convenience of the user, the output text file 400 will also contain the 
original list of peptides/epitopes 202, the weight values used 204, the motifs used 206, and 
5 Maxlnsertion value 208, and other parameter settings 402 entered into the input text file 
200 of Figure 11. 

The "Exhaustive J Search" looks at all permutations of the peptides and selects the 
ones that have the largest function sum. However, due to the factorial nature of 
permutations, as the number of peptides to be processed increases, the time required to 

10 complete an Exhaustive J Search increases almost exponentially. For example, using a 
standard Macintosh 333 MHz computer, the estimated running time for 13 peptides is 
approximately 2.9 hours and would be approximately 40 hours for 14 peptides. The 
"Stochastic J Search" is designed to search many areas of the permutation sequence, rather 
than the entire permutation space, and report the best function sum that it finds. By 

1 5 reporting only permutations that meet or exceed the current maximum function total, it is 
possible to search a much broader area of the permutation sequence. This technique has 
been successful with as many as 20 peptides. The time to perform an exhaustive search of 
20 peptides is estimated to be on the order of 1 .3 x 105 years. 

Referring again to Figure 12, at step 309, the program determines whether to 

20 perform an Exhaustive or Stochastic search of the possible permutations of polypeptides 
from the output text file 400. In a preferred embodiment, the determination at step 309 is 
made by the user who inputs whether the search will be Exhaustive or Stochastic as 
indicated by the input parameter, Exhaustive (Y /N) 220 (Fig. 11). In other embodiments, 
the program may automatically select either a Stochastic or Exhaustive search depending 

25 on the number of peptides to be processed. For example, if less than 14 epitopes are to be 
included, an Exhaustive search routine is automatically selected by the program. The 
Exhaustive search program examines all permutations of the epitopes making up the multi- 
epitope construct to find the one(s) with the best value for the sum of the optimizing 
function for all pairs of epitopes. This is guaranteed to find the "best" permutation(s) since 

30 all are examined. If 14 or more epitopes are to be included in the multi-epitope construct, a 
Stochastic search is used. In a preferred embodiment, the Stochastic search uses a Monte 
Carlo technique, known to those of skill in the art, to examine many regions of the 
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permutation space to find the best estimate of the optimum arrangement of the peptides. 
However, other methods of Stochastic searching may be implemented in accordance with 
the invention. For example, rather than randomly picking a starting permutation for each 
stochastic probe, the program may require that each probe begin with a permutation 
5 beginning with a different one of the peptides entered by the user. For example, if there 
were just three peptides, A, B and C, the three probes would begin with, for example, ABC, 
BAC and CBA. This method provides a fairly uniform coverage of the possible 
permutations. 

If a Stochastic search has been selected, next, at step 3 1 1 , the program begins the 

10 Stochastic search by initiating a probe. Next, at step 3 13, the program determines if the 
maximum search time per probe has been exceeded. If the maximum search time has not 
been reached, next, at step 315, the program determines whether a single probe has reached 
or exceeded the maximum number of "hits" per probe. In one embodiment, a probe hit is 
registered when a permutation's function value sum is the same as or greater than the 

1 5 largest function sum previously registered for one or more previously analyzed 

permutations. If the maximum number of hits per probe has not been reached, then, at step 
317, the current stochastic probe evaluates the next permutation or set of permutations and 
the process returns step 313. If at step 3 15 it is determined that the maximum number of 
hits per probe has been reached or exceeded, then, the program proceeds to step 319, where 

20 the program determines whether a maximum number of probes have already been 

executed. Also, if at step 313, it is determined that the maximum time limit per probe has 
been reached or exceeded, the program proceeds to step 319 to determine if the maximum 
number of probes have been completed. I£ at step 319, it is determined that the maximum 
number of probes has not been reached, the program returns to step 311 and a new search 

25 probe is initiated. If at step 3 1 9 it is determined that the maximum number of probes have 
been executed, the program then proceeds to step 323 where it outputs the best set of 
optimum permutations identified up to that point. This "best set" may consist of only those 
permutations having the highest function sum or, alternatively, may consist of the 
permutations having the top three highest function sums, for example, or any other output 

30 criteria desired by the user. 

In one preferred embodiment, if a probe has received a maximum number of hits 
specified per probe, any unused time for that probe is divided by the remaining probes to 
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decide how much time should be allocated to each of the remaining probes. In other words, 
if a probe terminates early because of finding too many hits then the remaining probes are 
allocated more time. Such functionality is easily implemented by those of ordinary skill in 
the computer programming arts. 
5 If at step 309, an Exhaustive search has been selected, then, at step 32 1 , an 

exhaustive search is initiated which analyzes every permutation, as described above. At the 
completion of the Exhaustive analysis, the program proceeds to step 323 where it outputs 
the "best set" of optimum permutations found. As mentioned above, this "best set" may 
include those permutations with the highest sum function values, or the top three highest 

10 - sum function values, or permutations meeting any desired criteria specified by the user 
(e.g., top 30 permutations with the highest function values). 

For each of the decision steps or determination steps discussed above (e.g., steps 
313, 315 and 319), the program may be set to perform a query at periodic intervals (e.g., 
every five seconds) or, alternatively, the program may be set to perform a query after a 

15 specified number of permutations (e.g., five) have been analyzed or after every permutation 
has been analyzed. Any one of these operation and timing protocols is easily implemented 
and adjusted by those of ordinary skill in the art. 

The Program output provides a list of the best arrangements of the epitopes. Since 
many permutations may have the same value of the evaluation function, several are 

20 generated so that other factors can be considered in choosing the optimum arrangement. 
Examples of multi-epitope constructs generated using the above-described computerized 
techniques are illustrated in Figure 9. An exemplary process flow implemented by the 
method and system of the invention is provided above. As would be readily apparent to 
those of ordinary skill, other factors such as charge distribution, hydrophobic/hydrophilic 

25 region analysis, or folding prediction could also be incorporated into the evaluation 

function to further optimize the multi-epitope constructs. In addition, the multi-epitope 
construct may be further optimized by processing a multi-epitope construct already 
optimized by the process through the same or similar process one or more additional times. 
In the subsequent rounds of processing one or more parameters may be modified as 

30 compared to the parameters used in the first round of optimization. An example of a multi 
-epitope construct that was optimized in two rounds is the HBV-30CL construct. 
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Multi-epitope constructs can also be optimized by determining the structure of each 
construct to be considered. Macromolecular structures such as polypeptide structures can 
be described in terms of various levels of organization. For a general discussion of this 
organization, see, e.g., Alberts et ai, Molecular Biology of the Cell (3 rd ed., 1994) and 
5 Cantor and Schimmel, Biophysical Chemistry Part I: The Conformation of Biological 
Macromolecules (1980). "Primary structure" refers to the amino acid sequence of a 
particular peptide. "Secondary structure" refers to locally ordered, three dimensional 
structures within a polypeptide. These structures are commonly known as domains. 
Domains are portions of a polypeptide that form a compact unit of the polypeptide. Typical 

1 0 domains are made up of sections of lesser organization such as stretches of P-sheet and. a- 
helices. "Tertiary structure" refers to the complete three' dimensional structure of a 
polypeptide monomer. "Quaternary structure" refers to the three dimensional structure 
formed by the noncovalent association of independent tertiary units. 

Structural predictions such as charge distribution, hydrophobic/hydrophilic region 

1 5 analysis, or folding predictions can be performed using sequence analysis programs known 
to those of skill in the art, for example, hydrophobic and hydrophilic domains can be 
identified (see t e.g., Kyte & Doolittle, J. Mol. Biol 157:105-132 (1982) and Stryer, 
Biochemistry (3 rd ed. 1988); see also any of a number of Internet based sequence analysis 
programs, such as those found at dot.imgen.bcm.tmc.edu. 

20 A three-dimensional structural model of a multi-epitope construct can also be 

generated. This is generally performed by entering amino acid sequence to be analyzed 
into the computer system. The amino acid sequence represents the primary sequence or 
subsequence of the protein, which encodes the structural information of the protein. The 
three-dimensional structural model of the protein is then generated by the interaction of the 

25 computer system, using software known to those of skill in the art. 

The amino acid sequence represents a primary structure that encodes the 
information necessary to form the secondary, tertiary and quaternary structure of the 
protein of interest. The software looks at certain parameters encoded by the primary 
sequence to generate the structural model. These parameters are referred to as "energy 

30 terms," and primarily include electrostatic potentials, hydrophobic potentials, solvent 

accessible surfaces, and hydrogen bonding. Secondary energy terms include van der Waals 
potentials. Biological molecules form the structures that minimize the energy terms in a 
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cumulative fashion. The computer program is therefore using these terms encoded by the 
primary structure or amino acid sequence to create the secondary structural model The 
tertiary structure of the protein encoded by the secondary structure is then formed on the 
basis of the energy terms of the secondary structure. The user can enter additional 
5 variables such as whether the protein is membrane bound or soluble, its location in the 
body, and its cellular location, e.g., cytoplasmic, surface, or nuclear. These variables along 
with the energy terms of the secondary structure are used to form the model of the tertiary 
structure. In modeling the tertiary structure, the computer program matches hydrophobic 
faces of secondary structure with like, and hydiuphilic faces of secondary structure with 
10 like. Those multi-epitope constructs that are most readily accessible to the HLA processing 
apparatus are then selected. 

Assessment Of Immunogenicitv Of Multi-epitope Vaccines 

The development of multi-epitope constructs represents a unique challenge, because 

15 the species-specificity of the peptide binding to MHC. Different MHC types from different 
species tend to bind different sets of peptides (Rammensee et al., Immunogenetics, 
Vol. 41(4):178-228 (1995)). As a result, it is not possible to test in regular laboratory 
animals a construct composed of human epitopes. Alternatives to overcome this limitation 
are generally available. They include: 1) testing analogous constructs incorporating 

20 epitopes restricted by non-human MHC; 2) reliance on control epitopes restricted by non 
human MHC; 3) reliance on crossreactivity between human and non-human MHC; 4) the 
use of HLA transgenic animals; and 5) antigenicity assays utilizing human cells in vivo. 
The following is a brief overview of the development of the technology for analyzing 
antigenicity and immunogenicity. 

25 

Class I HLA Transgenics 

The utility of HLA transgenic mice for the purpose of epitope identification (Sette 
et al., J Immunol, Vol. 153(12):5586-92 (1994); Wentworth et al., Int Immunol, Vol. 
8(5):651-9 (1996); Engelhard et al.,JImmunol, Vol. 146(4): 1226-32 (1991); Manet aL,/nf 
30 Immunol, Vol. 7(4):597-605 (1995); Shirai et al., J Immunol, Vol. 154(6):2733-42 (1995)), 
and vaccine development (Ishioka et al., J Immunol, Vol. 162(7):3915-25 (1999)) has been 
established. Most of the published reports have investigated the use of HLA A2. 1/K b mice 
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but it should be noted that B*27, and B*3501 mice are also available. Furthermore, HLA 
A*ll/K b mice (Alexander tit bL, J Immunol. Vol. 159(10):4753-61 (1997)), and HLA 
B7/K b and HLA Al/K b mice have also been generated. 

Data from 38 different potential epitopes was analyzed to determine the level of 
5 overlap between the A2.1 -restricted CTL repertoire of A2.1/K b -transgenic mice and A2.1+ 
humans (Wentworth et al., Eur J Immunol, Vol. 26(1):97-101 (1996)). In both humans and 
mice, an MHC peptide binding affinity threshold of approximately 500 nM correlates with ■ 
the capacity of a peptide to elicit a CTL response in vivo. A high level of concordance 
between the human data in vivo and mouse data in vivo was observed for 85% of the high- 

10 binding peptides, 58% of the intermediate binders, and 83% of the low/negative binders. ; 
Similar results were also obtained with HLA Al 1 and HLA B7 transgenic mice (Alexander 
et al, J Immunol Vol. 159(10):4753-61 (1997)). Thus, because of the extensive overlap 
that exists between T cell receptor repertoires of HLA transgenic mouse and human CTLs, 
transgenic mice are valuable for assessing immunogenicity of the multi-epitope constructs 

15 described herein. 

The different specificities of TAP transport as it relates to HLA Al 1 mice does not 
prevent the use of HLA-A1 1 transgenic mice of evaluation of immunogenicity. While both 
murine and human TAP efficiently transport peptides with an hydrophobic end, only 
human TAP has been reported to efficiently transport peptides with positively charged C 

20 terminal ends, such as the ones bound by A3, Al 1 and other members of the A3 supertype. 
This concern does not apply to A2, Al or B7 because both murine and human TAP should 
be equally capable of transporting peptides bound by A2, B7 or Al . Consistent with this 
understanding, Vitiello (Vitiello et al., J Exp Med, Vol. 173(4): 1007-1 5 (1991)) and 
Rotzschke (Rotzschke O, Falk K., Curr Opin Immunol, Vol. 6(1):45-51 (1994)) suggested 

25 that processing is similar in mouse and human cells, while Cerundolo (Rotzschke O, Falk 
K., Curr Opin Immunol, Vol. 6(1):45-51 (1994)) suggested differences in murine versus 
human cells, both expressing HLA A3 molecules. However, using HLA Al 1 transgenics, 
expression of HLA molecules on T and B cells in vivo has been observed, suggesting that 
the reported unfavorable specificity of murine TAP did not prevent stabilization and 

30 transport of Al 1/K b molecules in vivo (Alexander et al., J Immunol Vol. 159(10):4753-61 
(1997)). These data are in agreement with the previous observation that peptides with a 
charged C termini could be eluted from murine cells transfected with Al 1 molecules 
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(Maier et al., Immunogenetics; Vol. 40(4):306-8 (1994)). Responses in HLA Al 1 mice to 
complex antigens, such as influenza, and most importantly to Al 1 restricted epitopes 
encoded by multi-epitope constructs (Ishioka et al., J Immunol, Vol. 162(7):3915-25 
(1999)) has also been detected. Thus, the TAP issue appears to be of minor concern with 

. 5 transgenic mice. 

Another issue of potential relevance in the use of HLA transgenic mice is the 
possible influence of f$2 microglobulin on HLA expression and binding specificity. It is 
well known that human p2 binds both human and mouse MHC with higher affinity and 
stability than mouse P2 microglobulin (Shields et al., Mol Immunol Vol. 35(14-15):919-28 

10 (1998)). It is also well known that more stable complexes of MHC heavy chain and p2 

facilitate exogenous loading of MHC Class I (Vitiello et al., Science, Vol. 250(4986): 1423- 
6 (1990)). We have examined the potential effect of this variable by generating mice that 
are double transgenics for HLA/K b and human p2. Expression of human p2 was beneficial 
in the case HLA B7/K b mice, and was absolutely essential to achieve good expression ' 

15 levels in the case of HLA Al transgenic mice. Accordingly, HLA/K b and p2 double 
transgenic mice are currently and routinely bred and utilized by the present inventors. 
Thus, HLA transgenic mice can be used to model HLA-restricted recognition of four major 
HLA specificities (namely A2, Al 1, B7 and Al) and transgenic mice for other HLA 
specificities can be developed as suitable models for evaluation of immunogenicity. 

20 

Antigenicity testing for Class I epitopes 

Several independent lines of experimentation indicate that the density of Class 

I/peptide complexes on the cell surface may correlate with the level of T cell priming. 

Thus, measuring the levels at which an epitope is generated and presented on an APC's 
25 surface provides an avenue to indirectly evaluate the potency of multi-epitope nucleic acid 

vaccines in human cells in vitro. As a complement to the use of HLA Class I transgenic 

mice, this approach has the advantage of examining processing in human cells. (Ishioka et 

al., J Immunol Vol. 162(7):3915-25 (1999)) 

Several possible approaches to experimentally quantitate processed peptides are 
30 available. The amount of peptide on the cell surface can be quantitated by measuring the 

amount of peptide eluted from the APC surface (Sijts et al., J Immunol, Vol. 156(2):683-92 

(1996); Demote et al., Nature, Vol. 342(6250):682-4 (1989)). Alternatively, the number of 
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peptide-MHC complexes can be estimated by measuring the amount of lysis or lymphokine 
release induced by infected or transfected target cells, and then determining the 
concentration of peptide necessary to obtain equivalent levels of lysis or lymphokine 
release (Kageyama et aL, J Immunol, Vol. 154(2):567-76 (1995)). 
5 A similar approach has also been used to measure epitope presentation in multi- 

epitope nucleic acid-transfected cell lines. Specifically, multi-epitope constructs that are 
immunogenic in HLA transgenic mice are also processed into optimal epitopes by human 
cells transfected with the same constructs, and the magnitude of the response observed in 
transgenic mice correlates with the antigenicity observed with the transfected human target 

10 cells (Ishioka et al., J Immunol, Vol. 162(7):3915-25 (1999)). 

Using antigenicity assays, a number of related constructs differing in epitope order • 
or flanking residues can be transfected into APCs, and the impact of the aforementioned 
variables on epitope presentation can be evaluated. This can be a preferred system for 
testing where a relatively large number of different constructs need to be evaluated. 

15 Although it requires large numbers of epitope-specific CTLs, protocols that allow for the 
generation of highly sensitive CTL lines (Alexander-Miller et al., Proc Natl Acad Sci U S 
A, Vol. 93(9):4102-7 (1996)) and also for their expansion to large numbers (Greenberg 
P.D., Riddell S.R., Science, Vol. 285(5427):546-51 (1999)) have been developed to address 
this potential problem. 

20 It should also be kept in mind that, if the cell selected for the transfection is not 

reflective of the cell performing APC function in vivo, misleading results could be 
obtained. Cells of the B cell lineage, which are known "professional" APCs, are typically 
employed as transfection recipients. The use of transfected B cells of this type is an 
accepted practice in the field. Furthermore, a good correlation has already been noted 

25 between in vitro data utilizing transfected human B cells and in vivo results utilizing HLA 
transgenic mice, as described in more detail herein. 



Measuring HTL responses 

In preferred embodiments, vaccine constructs are optimized to induce Class II 
30 restricted immune responses. One method of evaluating multi-epitope constructs including 
Class II epitopes, is to use HLA-DR transgenic mice. Several groups have produced and 
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characterized HLA-DR transgenic mice (Taneja V., David C.S., Immunol Rev, Vol. 169:67- 
79 (1999)). 

An alternative also exists which relies on crossreactivity between certain human 
MHC molecules and particular MHC molecules expressed by laboratory animals. Bertoni 
5 and colleagues (Bertoni et al., J Immunol Vol. 161(8):4447-55 (1998)) have noted that 
appreciable crossreactivity can be demonstrated between certain HLA Class I supertypes 
and certain PATR molecules expressed by chimpanzees. Crossreactivity between human 
and macaques at the level of Class II (Geluk et al., J Exp Med, Vol. 177(4):979-87 (1993)) 
and Class I molecules (Dzuris, et al, J. Immunol. , July 1999) has also been noted. Finally, 

10 it can also be noted that the motif recognized by human HLA B7 supertype is essentially 
the same as the one recognized by the murine Class I L d (Rammensee et al., 
Immunogenetics, Vol. 41(4):178-228 (1995)). Of relevance to testing HLA DR restricted 
epitopes in mice, it has been shown by Wall, et al (Wall et al., 1 Immunol, 152:4526-36 
(1994)) that similarities exist in the motif of DR1 and IA b . We routinely breed our 

15 transgenic mice to take advantage of this fortuitous similarity. Furthermore, we have also 
shown that most of our peptides bind to IA b , so that we use these mice for the study of CTL 
and HTL immunogenicity. 

Measuring and Ouantitating Immune Responses from Clinical Samples 
20 A crucial element to assess vaccine performance is to evaluate its capacity to induce 

immune responses in vivo. Analyses of CTL and HTL responses against the immunogen, 
as well as against common recall antigens are commonly used and are known in the art. 
Assays employed included chromium release, lymphokine secretion and 
lymphoproliferation assays. 
25 More sensitive techniques such as the ELISPOT assay, intracellular cytokine 

staining, and tetramer staining have become available in the art. It is estimated that these 
newer methods are 10- to 100-fold more sensitive than the common CTL and HTL assays ■■ 
(MuraU-Krishna et al., Immunity, Vol. 8(2): 177-87 (1998)), because the traditional methods 
measure only the subset of T cells that can proliferate in vitro, and may, in fact, be 
30 representative of only a fraction of the memory T cell compartment (Ogg G.S., McMichael 
AJ., Curr Opin Immunol Vol. 10(4):393-6 (1998)). Specifically in the case of HTV, these 
techniques have been used to measure antigen-specific CTL responses from patients that 
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would have been undetectable with previous techniques (Ogg et al., Science, Vol. 
279(5359):2103-6 (1998); Gray et al., J Immunol, Vol. 162(3):1780-8 (1999); Ogg et al., J 
Virol Vol. 73(1 1):9153-60 (1999); Kalams et al., J Virol Vol. 73(8):,6721-8 (1999); 
Larsson et al., AIDS, Vol. 13(7):767-77 (1999); Corne et al., JAcquir Immune Defic Syndr 
5 Hum Retroviral Vol. 20(5):442-7 (1999)). 

With relatively few exceptions, direct activity of freshly isolated cells has been 
difficult to demonstrate by the means of traditional assays (Ogg G.S., McMichael A.J., 
Curr Opin Immunol, Vol. 10(4):393-6 (1998)). However, the increased sensitivity of the . 
newer techniques has allowed investigators to detect responses from cells freshly isolated 

10 from infected humans or experimental animals (Murali-Krishna et al., Immunity, Vol. 
8(2):177-87 (1998); Ogg G.S., McMichael A.J., Curr Opin Immunol Vol. 10(4):393-6 
(1998)). The availability of these sensitive assays, that do not depend on an in vitro 
restimulation step, has greatly facilitated the study of CTL function in natural infection and 
cancer. In contrast, assays utilized as an endpoint to judge effectiveness of experimental 

15 vaccines are usually performed in conjunction with one or more in vitro restimulation steps 
(Ogg G.S., McMichael A.J., Curr Opinlmmunol Vol. 10(4):393-6 (1998)). In fact, with 
few exceptions (Hanke et al., Vaccine, Vol. 16(4):426-35 (1998)), freshly isolated Class I- 
restricted CD8+ T cells have been difficult to demonstrate in response to immunization 
with experimental vaccines designed to elicit CTL responses. The use of sensitive assays, 

20 such as ELISPOT or in situ EETSfy ELISA, have been combined with a restimulation step to ' 
achieve maximum sensitivity; MHC tetramers are also used for this purpose. 

MHC tetramers were first described in 1996 by AJtman and colleagues. They 
produced soluble HLA-A2 Class I molecules which were folded with HTV-specific 
peptides containing a CTL epitope complexed together into tetramers tagged with 

25 fluorescent markers. These are used to label populations of T cells from HIV-infected 
individuals that recognize the epitope (Ogg G.S., McMichael A.J., Curr Opin Immunol, 
Vol. 10(4):393-6 (1998)). These cells were then quantified by flow cytometry, providing a 
frequency measurement for the T cells that are specific for the epitope. This technique has 
become very popular in HIV research as well as in other infectious diseases (Ogg G.S., 

30 McMichael A.J., Curr Opin Immunol, Vol. 10(4):393-6 (1998); Ogg et al., Science, Vol. 
279(5359):2103-6 (1998); Gray et al., J Immunol, Vol. 162(3):1780-8 (1999); Ogget al.,/ 
Virol Vol. 73(ll):9153-60 (1999); Kalams et al., J Virol, Vol. 73(8):6721-8 (1999)). 
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However, HLA polymorphism can limit the general applicability of this technique, in that 
the tetramer technology relies on defined HLA/peptide combinations. However, it has been 
shown that a variety of peptides, including HIV-derived peptides, are recognized by 
peptide-specific CTL lines in the context of different members of the A2, A3 and B7 
5 supertypes (Threlkeld et al., J Immunol Vol. 159(4): 1648-57 (1997); Bertoni et sL t JCIin 
Invest, Vol. 100(3):503-13 (1997)). Taken together these observations demonstrate that a T 
cell receptor (TCR) for a given MHC/peptide combination can have detectable affinity for 
the same peptide presented by a different MHC molecule from the same supertype. 

In circumstances in which efficacy of a prophylactic vaccine is primarily correlated 

1 0 with the induction of a long-lasting memory response, restirnulation assays can be the most 
appropriate and sensitive measures to monitor vaccine-induced immunological responses. 
Conversely, in the case of therapeutic vaccines, the main immunological correlate of 
activity can be the induction of effector T cell function, most aptly measured by primary 
assays. Thus, the use of sensitive assays allows for the most appropriate testing strategy for 

1 5 immunological monitoring of vaccine efficacy. 

Antigenicity of Multi-epitope Constructs in Transfected Human APC's 

Antigenicity assays are performed to evaluate epitope processing and presentation 
in human cells. An episomal vector to efficiently transfect human target cells with multi- 

20 epitope nucleic acid vaccines is used to perform such an analysis. 

For example, 221 A2K b target cells were transfected with an HTV-1 EpiGene™ 
vaccine. The 221 A2K b target cell expresses the A2K b gene that is expressed in HLA 
transgenic mice, but expresses no endogenous Class I (Shimizu Y, DeMars R, J Immunol, 
Vol. 142(9):3320-8 (1989)). These transfected cells were assayed for their capacity to 

25 present antigen to CTL lines derived from HLA transgenic mice and specific for various 
HIV-derived CTL epitopes. To correct for differences in antigen sensitivity of different 
CTL lines, peptide dose titrations, using untransfected cells as APC, were run in parallel. 
Representative data is presented in Fig. 8. In the case of HTV Pol 498-specific CTL, the 
transfected target cells induced the release of 378 pg/ml of IFNy. Inspection of the peptide 

30 dose responses reveals that, 48 ng/ml of exogenously added peptide was necessary to 
achieve similar levels of IFNy release. These results demonstrate that relatively large 
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amounts of Pol 498 epitope are presented by the transfected cells, equivalent to 48 ng/ml of 
exogenously added peptide. 



Table 5. ComDarison between antiaenicitv in transtected human cells and immunoaenicitv in HLA 


transgenic mice of the HIV-1 nrdniqene 








Epitope 


Antigenicity 
Peptide Equivalents 11 


n 2 > 


Immunogenicity 
% response 31 Magnitude 41 


HIV Pol 498 


30.5 


(6) 


95% 


46.7 


HIVEnv134 


6.2 


(3) 


62% 


16.1 


HIVNef221 


2.1 


(5) 


82% 


3.8 


HIV Gag 271 


<0.2 


(6) 


31% 


4 


1) ng/ml; 2) number of independent experiments; 3) % of CTL cultures yielding positive results; 4) Lytic Units 



By comparison, less than 25 pg/ml BFNy was detected utilizing the CTL specific for 
the Gag 271 epitope. The control peptide titration with untransfected target cells revealed 
5 that this negative result could not be ascribed to poor sensitivity of the particular CTL line 
utilized, because as little as 0.2 pg/ml of "peptide equivalents" (PE) could be detected. 
Thus, it appears that the Gag 271 epitope is not efficiently processed and presented in the 
HIV-1 transfected target cells. Utilizing the "peptide equivalents" figure as an approximate 
quantitation of the efficiency of processing, it can be estimated that at least 200-fold less 

1 0 Gag 27 1 is presented by the transfected targets, compared to the Pol 498 epitope. 

The results of various independent determinations for four different epitopes 
contained within HTV-FT are compiled in Table 5. The amount of each epitope produced 
from the HTV-FT transfected cells ranged from 30.5 PE for Pol 498, to a low of less than 
0.2 PE for Gag 271. The two epitopes Env 134 and Nef 221 were associated with 

15 intermediate values, of 6.1 and 2.1 PE, respectively. 

These results were next correlated with the in vivo immunogenicity values observed 
for each epitope after immunization with the HTV-FT construct. The Pol 498 epitope was 
also the most immunogenic, as would be predicted. The Env 134 and Nef 221 epitopes, for 
which intermediate immunogenicity was observed in vivo, were also processed in vitro 

20 with intermediate efficiency by the transfected human cells. Finally, the Gag 271, for 
which no detectable in vitro processing was observed was also associated with in vivo 
immunogenicity suboptimal in both frequency and magnitude. 
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These data have several important implications. First, they suggest that different 
epitopes contained within a given construct may be processed and presented with 
differential efficiency. Second, they suggest that immunogenicity is proportional to the 
amount of processed epitope generated. Finally, these results provide an important 
5 validation of the use of transgenic mice for the purpose of optimization of multi-epitope 
vaccines destined for human use. 

IIL Preparation of Multi-epitope Constructs 

Epitopes for inclusion in the multi-epitope constructs typically bear HLA Class I or 
10 Class II binding motifs as described, for example, in PCT applications PCT7US00/27766, ' 
or PCT/US00/19774. Multi-epitope constructs can be prepared according to the methods 
set forth in Ishioka, et aL. J. Immunol (1999) 162(7):3915-3925, for example. 

Multiple HLA class II or class I epitopes present in a multi-epitope construct can be 
derived from the same antigen, or from different antigens. For example, a multi-epitope 
15 construct can contain one or more HLA epitopes that can be derived from two different 
antigens of the same virus or from two different antigens of different viruses. Epitopes for 
inclusion in a multi-epitope construct can be selected by one of skill in the art, e.g., by 
using a computer to select epitopes that contain HLA allele-specific motifs or supermotifs. 
The multi-epitope constructs of the invention can also encode one or more broadly cross- 
20 reactive binding, or universal, HLA class II epitopes, eg, PADRE® (Epimmuhe, San 
Diego, CA), (described, for example, in U.S. Patent Number 5,736,142) or a PADRE® 
family molecule. 

Universal HLA Class II epitopes can be advantageously combined with other HLA • 
Class I and Class II epitopes to increase the number of cells that are activated in response to 
25 a given antigen and provide broader population coverage of HLA-reactive alleles. Thus, 
the multi-epitope constructs of the invention can include HLA epitopes specific for an 
antigen, universal HLA class II epitopes, or a combination of specific HLA epitopes and at 
least one universal HLA class II epitope. 

HLA Class I epitopes are generally about 8 to about 13 amino acids in length, in 
30 particular 8, 9, 10, or 1 1 amino acids in length. HLA Class II epitopes are generally about 
6 to 25 amino acids in length, in particular about 13 to 21 amino acids in length. . An HLA 
Class I or II epitope can be derived from any desired antigen of interest. The antigen of 
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interest can be a viral antigen, surface receptor, tumor antigen, oncogene, enzyme, or any 
pathogen, cell or molecule for which an immune response is desired. Epitopes can be 
selected based on their ability to bind one or multiple HLA alleles. Epitopes that are 
analogs of naturally occuring sequences can also be included in the multi-epitope 
5 constructs described herein. Such analog peptides are described, for example, in PCT 
applications PCTYUS97/03778, PCT7US00/19774, and co-pending U.S.S.N. 09/260,714 
filed 3/1/99. 

Given the methods described herein for optimizing epitope configuration and 
spacers between the epitopes, the skilled artisan may include any HLA epitopes into the 

1 0 multi-epitope constructs described herein. Figures 2, 3, 9, 17, and 18A-1 8N depict 
exemplary constructs, where epitopes in the constructs are listed in Figures 19A-19E. 
Exemplary constructs are also set forth in Figures 20B, 20D, 20E, and 20F (epitopes are , 
listed in Figure 20A); Figures 21B, 21D, and 21E (epitopes are listed in Figure 21 A); 
Figures 22B, 22D, and 22E (epitopes are listed in 22A); Figure 23C; and Figures 24B and 

15 24C (epitopes are listed in Figure 24A). Multi-epitope constructs may include five or 
more, or six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, twenty, 
twenty-five, or thirty or more of the epitopes set forth in Figures 19A-19E, 20A, 21 A, 22A, 
and 24 A. Multi-epitope constructs that include any combinations of these epitopes can be 
optimized using the procedures set forth herein, and spacers can be optimized as well. 

20 Multi-epitope constructs can be generated using methodology well known in the art. 

For example, polypeptides comprising the multi-epitope constructs can be synthesized and 
linked. Typically, multi-epitope constructs are constructed using recombinant DNA 
technology. 

25 IV. Expression Vectors and Construction of a Multi-Epitope Constructs 

The multi-epitope constructs of the invention are typically provided as an 
expression vector comprising a nucleic acid encoding the multi-epitope polypeptide. 
Construction of such expression vectors is described, for example in PCT/US99/10646. 
The expression vectors contain at least one promoter element that is capable of expressing a 
30 transcription unit encoding the nucleic acid in the appropriate cells of an organism so that 
the antigen is expressed and targeted to the appropriate HLA molecule. For example, for 
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administration to a human, a promoter element that functions in a human cell is 
incorporated into the expression vector. 

In preferred embodiments, the invention utilizes routine techniques in the field of 
recombinant genetics. Basic texts disclosing the general methods of use in this invention 
5 include Sambrook et aL, Molecular Cloning, A Laboratory Manual (2nd ed. 1989); 
Kriegler, Gene Transfer and Expression: A Laboratory Manual (1990); and Current 
Protocols in Molecular Biology (Ausubel et aL, eds., 1994); Oligonucleotide Synthesis: A 
Practical Approach (Gait, ed. 5 1984); Ruijpers, Nucleic Acids Research 18(17):5197 
(1994); Dueholm, J. Org. Chem. 59:5767-5773 (1994); Methods in Molecular Biology, 
10 volume 20 (Agrawal, ed.); and Tijssen, Laboratory Techniques in Biochemistry and 
Molecular Biology-Hybridization with Nucleic Acid Probes, e.g., Part I, chapter 2 
"Overview of principles of hybridization and the strategy of nucleic acid probe assays" 
(1993)). 

The nucleic acids encoding the epitopes are assembled in a construct according to 

15 standard techniques. In general, the nucleic acid sequences encoding multi-epitope 

polypeptides are isolated using amplification techniques with oligonucleotide primers, or 
are chemically synthesized. Recombinant cloning techniques can also be used when 
appropriate. Oligonucleotide sequences are selected which either amplify (when using 
PCR to assemble the construct) or encode (when using synthetic oligonucleotides to 

20 assemble the construct) the desired epitopes. 

Amplification techniques using primers are typically used to amplify and isolate 
sequences encoding the epitopes of choice from DNA or RNA {see U.S. Patents 4,683,195 
and 4,683,202; PCR Protocols: A Guide to Methods and Applications (Innis et aL, eds, 
1990)). Methods such as polymerase chain reaction (PCR) and ligase chain reaction (LCR) 

25 can be used to amplify epitope nucleic acid sequences directly from mRNA, from cDNA, 
from genomic libraries or cDNA libraries. Restriction endonuclease sites can be 
incorporated into the primers. Multi-epitope constructs amplified by the PCR reaction can 
be purified from agarose gels and cloned into an appropriate vector. 

Synthetic oligonucleotides can also be used to construct multi-epitope constructs. 

30 This method is performed using a series of overlapping oligonucleotides, representing both 
the sense and non-sense strands of the gene. These DNA fragments are then annealed, 
ligated and cloned. Oligonucleotides that are not commercially available can be chemically 
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synthesized according to the solid phase phosphoramidite triester method first described by 
Beaucage & Caruthers, Tetrahedron Letts. 22:1859-1862 (1981), using an automated 
synthesizer, as described in Van Devanter et ai, Nucleic Acids Res. 12:6159-6168 (1984). 
Purification of oligonucleotides is by either native acrylamide gel electrophoresis or by 
5 . anion-exchange HPLC as described in Pearson & Reanier, J. Chrom. 255:137-149 (1983). 
The epitopes of the multi-epitope constructs are typically subcloned into an 
expression vector that contains a strong promoter to direct transcription, as well as other 
regulatory sequences such as enhancers and polyadenylation sites. Suitable promoters are 
well known in the art and described, e.g., in Sambrook et al and Ausubel et aL Eukaryotic 
10 expression systems for mammalian cells are well known in the art and are commercially 
available. Such promoter elements include, for example, cytomegalovirus (CMV), Rous 
sarcoma virus LTR and SV40. 

The expression vector typically contains a transcription unit or expression cassette 
that contains all the additional elements required for the expression of the multi-epitope 
1 5 construct in host cells. A typical expression cassette thus contains a promoter operably 
linked to the multi-epitope construct and signals required for efficient polyadenylation of 
the transcript. Additional elements of the cassette may include enhancers and introns with 
functional splice donor and acceptor sites. 

In addition to a promoter sequence, the expression cassette can also contain a 
20 transcription termination region downstream of the structural gene to provide for efficient 
termination. The termination region may be obtained from the same gene as the promoter 
sequence or may be obtained from different genes. 

The particular expression vector used to transport the genetic information into the 
cell is not particularly critical. Any of the conventional vectors used for expression in 
25 eukaryotic cells may be used. Expression vectors containing regulatory elements from 
eukaryotic viruses are typically used in eukaryotic expression vectors, e.g., SV40 vectors, 
CMV vectors, papilloma virus vectors, and vectors derived from Epstein Bar virus. 

The multi-epitope constructs of the invention can be expressed from a variety of 
vectors including plasmid vectors as well as viral or bacterial vectors. Examples of viral 
30 expression vectors include attenuated viral hosts, such as vaccinia or fowlpox. As an 

example of this approach, vaccinia virus is used as a vector to express nucleotide sequences 
that encode the peptides of the invention. Upon introduction into a host bearing a tumor, 
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the recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits a 
host CTL and/or HTL response. Vaccinia vectors and methods useful in immunization 
protocols are described in, e.g., U.S. Patent No. 4,722,848. 

A wide variety of other vectors useful for therapeutic administration or 
5 immunization, e.g. adeno and adeno-associated virus vectors, retroviral vectors, non-viral 
vectors such as BCG (Bacille Calmette Guerin), Salmonella typhi vectors, detoxified 
anthrax toxin vectors, and the like, will be apparent to those skilled in the art. 

hnmunogenicity and antigenicity of the multi-epitope constructs are evaluated as 
described herein. 

10 

Targeting Sequences 

The expression vectors of the invention may encode one or more MHC epitopes 
operably linked to a MHC targeting sequence, and are referred to herein as "targeting 
nucleic acids" or "targeting sequences." The use of a MHC targeting sequence enhances 

15 the immune response to an antigen, relative to delivery of antigen alone, by directing the 
peptide epitope to the site of MHC molecule assembly and transport to the cell surface, 
thereby providing an increased number of MHC molecule-peptide epitope complexes 
available for binding to and activation of T cells. 

. MHC Class I targeting sequences can be used in the present invention, e.g., those 

20 sequences that target an MHC Class I epitope peptide to a cytosolic pathway or to the 

endoplasmic reticulum (see, e.g., Rammensee et al, Immunogenetics 41:178-228 (1995)).' 
For example, the cytosolic pathway processes endogenous antigens that are expressed 
inside the cell. Although not wishing to be bound by any particular theory, cytosolic 
proteins are thought to be at least partially degraded by an endopeptidase activity of a 

25 proteasome and then transported to the endoplasmic reticulum by the TAP molecule 

(transporter associated with processing). In the endoplasmic reticulum, the antigen binds to 
MHC Class I molecules. Endoplasmic reticulum signal sequences bypass the cytosolic 
processing pathway and directly target endogenous antigens to the endoplasmic reticulum, 
where proteolytic degradation into peptide fragments occurs. Such MHC Class I targeting 

30 sequences are well known in the art, and include, e.g., signal sequences such as those from 
Ig kappa, tissue plasminogen activator or insulin. A preferred signal peptide is the human 
Ig kappa chain sequence. Endoplasmic reticulum signal sequences can also be used to 
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target MHC Class II epitopes to the endoplasmic reticulum, the site of MHC Class I 
molecule assembly. MHC Class II targeting sequences can also be used in the invention, 
e.g., those that target a peptide to the endocytic pathway. These targeting sequences 
typically direct extracellular antigens to enter the endocytic pathway, which results in the 
5 antigen being transferred to the lysosomal compartment where the antigen is proteolytically 
cleaved into antigen peptides for binding to MHC Class II molecules. As with the normal 
processing of exogenous antigen, a sequence that directs a MHC Class II epitope to the 
endosomes of the endocytic pathway and/or subsequently to lysosomes, where the MHC 
Class II epitope can bind to a MHC Class II molecule, is a MHC Class II targeting 

10 sequence. For example, group of MHC Class II targeting sequences useful in the invention 
are lysosomal targeting sequences, which localize polypeptides to lysosomes. Since MHC 
Class II molecules typically bind to antigen peptides derived from proteolytic processing of 
endocytosed antigens in lysosomes, a lysosomal targeting sequence can function as a MHC 
Class II targeting sequence. Lysosomal targeting sequences are well known in the art and 

15 include sequences found in the lysosomal proteins LAMP-1 and LAMP-2 as described by 
August et ah (U.S. Patent No. 5,633,234, issued May 27, 1997), which is incorporated 
herein by reference. 

Other lysosomal proteins that'contain lysosomal targeting sequences include HLA- 
DM. HLA-DM is an endosomal/lysosomal protein that functions in facilitating binding of 
20 antigen peptides to MHC Class II molecules. Since it is located in the lysosome, HLA-DM 
has a lysosomal targeting sequence that can function as a MHC Class II molecule targeting 
sequence (Copier et al, 1 Immunol 157:1017-1027 (1996), which is incorporated herein 
by reference). 

The resident lysosomal protein HLA-DO can also function as a lysosomal targeting 
25 sequence. In contrast to the above described resident lysosomal proteins LAMP-1 and « 

HLA-DM, which encode specific Tyr-containing motifs that target proteins to lysosomes; 

HLA-DO is targeted to lysosomes by association with HLA-DM (Liljedahl et al. t EMBO J. 

15:481 7-4824 (1 996)), which is incorporated herein by reference. Therefore, the sequences 

of HLA-DO that cause association with HLA-DM and, consequently, translocation of 
30 HLA-DO to lysosomes can be used as MHC Class II targeting sequences. Similarly, the • 

murine homolog of HLA-DO, H2-DO, can be used to derive a MHC Class II targeting 
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sequence. A MHC Class II epitope can be fused to HLA-DO or H2-DO and targeted to 
lysosomes. 

In another example, the cytoplasmic domains of B cell receptor subunits Ig-a and 
Ig-jS mediate antigen internalization and increase the efficiency of antigen presentation as 

• 5 described in, for example, Bonnerot et ah, Immunity 3:335-347 (1995). Therefore, the 

cytoplasmic domains of the Ig-a and Ig-jS proteins can function as MHC Class II targeting 
sequences that target a MHC Class II epitope to the endocytic pathway for processing and 
binding to MHC Class II molecules. 

Another example of a MHC Class II targeting sequence that directs MHC Class II 

10 epitopes to the endocytic pathway is a sequence that directs polypeptides to be secreted, 
where the polypeptide can enter the endosomal pathway. These MHC Class II targeting 
sequences that direct polypeptides to be secreted mimic the normal pathway by which 
exogenous, extracellular antigens are processed into peptides that bind to MHC Class II 
molecules. Any signal sequence that functions to direct a polypeptide through the 

1 5 endoplasmic reticulum and ultimately to be secreted can function as a MHC Class II 

targeting sequence so long as the secreted polypeptide can enter the endosomal/lysosomal 
pathway and be cleaved into peptides that can bind to MHC Class II molecules. 

In another example, the Ii protein binds to MHC Class IMHC Class II molecules in 
the endoplasmic reticulum, where it functions to prevent peptides present in the 

20 endoplasmic reticulum from binding to the MHC Class II molecules. Therefore, fusion of a 
MHC Class II epitope to the Ii protein targets the MHC Class II epitope to the endoplasmic 
reticulum and a MHC Class II molecule. For example, the CLIP sequence of the Ii protein 
can be removed and replaced with a MHC Class IMHC Class II epitope sequence so that 
the MHC Class II epitope is directed to the endoplasmic reticulum, where the epitope binds 

25 to a MHC Class II molecule. 

In some cases, antigens themselves can serve as MHC Class II or I targeting 
sequences and can be fused to a universal MHC Class II epitope to stimulate an immune 
response. Although cytoplasmic viral antigens are generally processed and presented as 
complexes with MHC Class I molecules, long-lived cytoplasmic proteins such as the 

30 influenza matrix protein can enter the MHC Class IMHC Class II molecule processing 
pathway as described in, for example, Gueguen & Long, Proc, Nath Acad. Set USA 
93:14692-14697 (1996). Therefore, long-lived cytoplasmic proteins can function as a 
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MHC Class IMHC Class II targeting sequence. For example, an expression vector 
encoding influenza matrix protein fused to a universal MHC Class IMHC Class II epitope 
can be advantageously used to target influenza antigen and the universal MHC Class IMHC 
Class II epitope to the MHC Class IMHC Class II pathway for stimulating an immune 
5 response to influenza. 

Other examples of antigens functioning as MHC Class IMHC Class II targeting 
sequences include polypeptides that spontaneously form particles. The polypeptides are 
secreted from the cell that produces them and spontaneously form particles, which are 
taken up into an antigen-presenting cell by endocytosis such as receptor-mediated 

10 endocytosis or are engulfed by phagocytosis. The particles are proteolytically cleaved into 
antigen peptides after entering the endosomal/lysosomal pathway. 

One such polypeptide that spontaneously forms particles is HBV surface antigen 
(HBV-S) as described in, for example, Diminsky et al, Vaccine 15:637-647 (1997) or Le 
Borgne et al f Virology 240:304-315 (1998). Another polypeptide that spontaneously forms 

15 particles is HBV core antigen as described in, for example, Kuhrober et al, International 
Immunol. 9: 1203-1212 (1997). Still another polypeptide that spontaneously forms particles 
is the yeast Ty protein as described in, for example, Weber et aL, Vaccine 13:831-834 
(1995). For example, an expression vector containing HBV-S antigen fused to a universal 
MHC Class IMHC Class II epitope can be advantageously used to target HBV-S antigen 

20 and the universal MHC Class IMHC Class II epitope to the MHC Class IMHC Class II 
pathway for stimulating an immune response to HBV. 

Administration In Vivo 

The invention also provides methods for stimulating an immune response by 
25 administering an expression vector of the invention to an individual. Administration of an 
expression vector of the invention for stimulating an immune response is advantageous 
because the expression vectors of the invention target MHC epitopes to MHC molecules, 
thus increasing the number of CTL and HTL activated by the antigens encoded by the 
expression vector. 

30 Initially, the expression vectors of the invention are screened in mouse to determine 

the expression vectors having optimal activity in stimulating a desired immune response. 
Initial studies are therefore carried out, where possible, with mouse genes of the MHC 
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targeting sequences. Methods of determining the activity of the expression vectors of the 
invention are well known in the art and include, for example, the uptake of 3 H-thymidine to 
measure T cell activation and the release of 51 Cr to measure CTL activity as described 
below in Examples II and III. Experiments similar to those described in Example IV are 
5 performed to determine the expression vectors having activity at stimulating an immune 
response. The expression vectors having activity are further tested in human. To 
circumvent potential adverse immunological responses to encoded mouse sequences, the 
expression vectors having activity are modified so that the MHC Class MHC Class H 
targeting sequences are derived from human genes. For example, substitution of the 
10 analogous regions of the human homologs of genes containing various MHC Class MHC 
Class II targeting sequences are substituted into the expression vectors of the invention. 
Expression vectors containing human MHC Class IMHC Class II targeting sequences, such 
as those described in Example I below, are tested for activity at stimulating an immune 
response in human. 

1 5 The invention also relates to pharmaceutical compositions comprising a 

pharmaceutically acceptable carrier and an expression vector of the invention. 
Pharmaceutically acceptable carriers are well known in the art and include aqueous or non- 
aqueous solutions, suspensions and emulsions, including physiologically buffered saline, 
alcohol/aqueous solutions or other solvents or vehicles such as glycols, glycerol, oils such 

20 as olive oil or injectable organic esters. 

A pharmaceutically acceptable carrier can contain physiologically acceptable 
compounds that act, for example, to stabilize the expression vector or increase the 
absorption of the expression vector. Such physiologically acceptable compounds include, 
for example, carbohydrates, such as glucose, sucrose or dextrans, antioxidants such as 

25 ascorbic acid or glutathione, chelating agents, low molecular weight polypeptides, 

antimicrobial agents, inert gases or other stabilizers or excipients. Expression vectors can 
additionally be complexed with other components such as peptides, polypeptides and 
carbohydrates. Expression vectors can also be complexed to particles or beads that can be 
administered to an individual, for example, using a vaccine gun. One skilled in the art 

30 would know that the choice of a pharmaceutically acceptable carrier, including a 

physiologically acceptable compound, depends, for example, on the route of administration 
of the expression vector. 
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The invention further relates to methods of administering a pharmaceutical 
composition comprising an expression vector of the invention to stimulate an immune 
response. The expression vectors are administered by methods well known in the art as 
described in, for example, Donnelly et al {Ann. Rev. Immunol 15:617-648 (1997)); 
5 Feigner et al (U.S. Patent No. 5,580,859, issued December 3, 1996); Feigner (U.S. Patent 
No. 5,703,055, issued December 30, 1997); and Carson et al (U.S. Patent No. 5,679,647, 
. issued October 21, 1997). In one embodiment, the multi-epitope construct is administered 
as naked nucleic acid. 

A pharmaceutical composition comprising an expression vector of the invention can 

10 be administered to stimulate an immune response in a subject by various routes including, 
for example, orally, intravaginally, rectally, or parenterally, such as intravenously, 
intramuscularly, subcutaneously, intraorbitally, intracapsularly, intraperitoneally, 
intracistemally or by passive or facilitated absorption through the skin using, for example, a 
skin patch or transdermal iontophoresis, respectively. Furthermore, the composition can be 

1 5 administered by injection, intubation or topically, the latter of which can be passive, for 
example, by direct application of an ointment or powder, or active, for example, using a 
nasal spray or inhalant. An expression vector also can be administered as a topical spray, 
in which case one component of the composition is an appropriate propellant. The 
pharmaceutical composition also can be incorporated, if desired, into liposomes, 

20 microspheres or other polymer matrices as described in, for example, Feigner et al., U.S. 
Patent No. 5,703,055; Gregoriadis, Liposome Technology, Vols. I to HI (2nd ed. 1993). 
Liposomes, for example, which consist of phospholipids or other lipids, are nontoxic, 
physiologically acceptable and metabolizable carriers that are relatively simple to make and 
administer. 

25 The expression vectors of the invention can be delivered to the interstitial spaces of 

tissues of an animal body as described in, for example, Feigner et ah, U.S. Patent Nos. 
5,580,859 and 5,703,055. Administration of expression vectors of the invention to muscle 
is a particularly effective method of administration, including intradermal and 
subcutaneous injections and transdermal administration. Transdermal administration, such 

30 as by iontophoresis, is also an effective method to deliver expression vectors of the 

invention to muscle. Epidermal administration of expression vectors of the invention can 
also be employed. Epidermal administration involves mechanically or chemically irritating 

69 



WO 02/083714 



PCT7US02/09877 



the outermost layer of epidermis to stimulate an immune response to the irritant (Carson et 
aL, U.S. Patent No. 5,679,647). 

Other effective methods of administering an expression vector of the invention to 
stimulate an immune response include mucosal administration as described in, for example, 
5 Carson et ai 9 U.S. Patent No. 5,679,647. For mucosal administration, the most effective 
method of administration includes intranasal administration of an appropriate aerosol 
containing the expression vector and a pharmaceutical composition. Suppositories and 
topical preparations are also effective for delivery of expression vectors to mucosal tissues , 
of genital, vaginal and ocular sites. Additionally, expression vectors can be complexed to ' 

1 0 particles and administered by a vaccine gun. 

The dosage to be administered is dependent on the method of administration and 
will generally be between about 0.1 /ig up to about 200 fig. For example, the dosage can be 
from about 0.05 figfkg to about 50 mg/kg, in particular about 0.005-5 mg/kg. An effective 
dose can be determined, for example, by measuring the immune response after 

1 5 administration of an expression vector. For example, the production of antibodies specific 
for the MHC Class II epitopes or MHC Class I epitopes encoded by the expression vector 
can be measured by methods well known in the art, including ELIS A or other 
immunological assays. In addition, the activation of T helper cells or a CTL response can 
be measured by methods well known in the art including, for example, the uptake of 3 H- 

20 thymidine to measure T cell activation and the release of 51 Cr to measure CTL activity (see 
Examples II and HI below). 

The pharmaceutical compositions comprising an expression vector of the invention 
can be administered to mammals, particularly humans, for prophylactic or therapeutic 
purposes. Examples of diseases that can be treated or prevented using the expression 

25 vectors of the invention include infection with HB V, HCV, HIV and CMV as well as 

prostate cancer, renal carcinoma, cervical carcinoma, lymphoma, condyloma acuminatum 
and acquired immunodeficiency syndrome (ADDS). 

In therapeutic applications, the expression vectors of the invention are administered 
to an individual already suffering from cancer, autoimmune disease or infected with a 

30 virus. Those in the incubation phase or acute phase of the disease can be treated with 

expression vectors of the invention, including those expressing all universal MHC Class II 
epitopes, separately or in conjunction with other treatments, as appropriate. 
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In therapeutic and prophylactic applications, pharmaceutical compositions 
comprising expression vectors of the invention are administered to a patient in an amount 
sufficient to elicit an effective immune response to an antigen and to ameliorate the signs or 
symptoms of a disease. The amount of expression vector to administer that is sufficient to 
5 ameliorate the signs or symptoms of a disease is termed a therapeutically effective dose. 
The amount of expression vector sufficient to achieve a therapeutically effective dose will 
depend on the pharmaceutical composition comprising an expression vector of the 
invention, the maimer of administration, the state and severity of the disease being treated^ 
the weight and general state of health of the patient and the judgment of the prescribing 
10 physician. 

Examples 

The following examples are offered to illustrate, but not to limit the claimed 
invention. It is understood that the examples and embodiments described herein are for 

15 illustrative purposes only and that various modifications or changes in light thereof are 
suggested to persons skilled in the art and are to be included within the spirit and purview 
of this application and scope of the appended claims. 

Examples 1-9 provide examples of assays for evaluating the immunogenicity and 
antigenicity of multi-epitope constructs. 

20 Example 1: 

Antigenicity Assays 

High-affinity peptide-specific CTL lines can be generated from splenocytes of 
transgenic mice that have been primed with DNA, peptide/IFA, or lipopeptide. Briefly, 
splenocytes from transgenic mice are stimulated 0. 1 \xg/m\ peptide and LPS blasts. Ten 

25 days after the initial stimulation, and weekly thereafter, cells are restimulated with LPS 
blasts pulsed for 1 hour with 0.1 |ig/ml peptide. CTL lines are assayed 5 days following 
restimulation in an in situ IFNy ELISA as described above. Alternatively, CTL lines that 
are derived from, e.g., patients infected with the targeted pathogen or who have the targeted 
disease, e.g., cancer, can be used. Specific CTL lines that are not available either from 

30 transgenic mice or from patients are generated from PBMC of normal donors, drawing on 
the expertise in the art. 
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Target cells to be used in these assays are Jurkat or .221 cells transfected with 
A2.1/K b , Al 1/K b , Al/K b , or B7/K b for CTL hnes derived from transgenic mice. All these 
cell lines are currently available to us (Epimmune Inc., San Diego, CA). In the case of 
human CTL lines, .221 cells transfected with the appropriate human HLA allele are 
utilized. We currently have .221 cells transfected with A2 and Al, and are generating All, 
A24 and B7 transfectants. In an alternative embodiment, if unforeseen problems arise in 
respect to target cells, LPS blasts and EBV-transformed lines are utilized for murine and 
human CTL lines, respectively. 

To assay for antigenicity, serially diluted CTLs are incubated with 10 5 target cells 
and multiple peptide concentrations ranging from 1 to 10" 6 |ig/ml. In addition, CTLs are 
also incubated with target cells transfected with an episomal vector containing a multi- 
epitope construct of interest. Episomal vectors are known in the art. 

The relative amount of peptide generated by natural processing within the multi- 
epitope nucleic acid-transfected APCs is quantitated as follows. The amount of IFNy 
generated by the CTL lines upon recognition of the transfected target cells are recorded. 
The amount of synthetic peptide necessary to yield the same amount of IFNy are 
interpolated from a standard curve generated when the same CTL line is incubated in 
parallel with known concentrations of peptide. 

20 Example 2: 

Mice, Immunizations and Cell Cultures 

The derivation of the HLA-A2.1/K b (Vitiello et al., J Exp Med, Vol. 173(4):1007-15 
(1991)) and All/K b (Alexander etzl., J Immunol, Vol. 159(10):4753-61 (1997)) transgenic 
mice used in this study has been described. HLA B7 K b transgenic mice are available in 

25 house (Epimmune Inc., San Diego, CA). HLA DR2, DR3 and DR4 transgenic mice are 
obtained from C. David (Mayo Clinic). Non-transgenic H-2 b mice are purchased from 
Charles River Laboratories or other commercial vendors. Immunizations are performed as 
described in (Ishioka et al., J Immunol, Vol. 162(7):3915-25 (1999)). All cells are grown in 
culture medium consisting of RPMI 1640 medium with HEPES (Gibco Life Technologies) 

30 supplemented with 10% FBS, 4 mM L-glutamine, 50 |*M 2-ME, 0.5 mM sodium pyruvate, 
100 |ig/ml streptomycin and 100 U/ml penicillin. 
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HLA transgenic mice and antigenicity assays are used for the purpose of testing and 
optimization CTL responses. The natural crossreactivity between HLA-DR and IA b can 
also be exploited to test HTL responses. This evaluation provides an assessment of the 
antigenicity and immunogenicity of multi-epitope constructs. • . . 

5 

Example 3: 
Proliferation Assays 

To assess the ability of HTL epitopes to induce an immune response, assays such as 
proliferation assays are often performed. For example, mouse CD4 T lymphocytes are 

10 immunomagnetically isolated from splenic single cell suspensions using DynaBeads Mouse 
CD4 (L3T4) (Dynal). Briefly, 2 x 10 7 spleen cells are incubated with 5.6 x 10 7 magnetic 
beads for 40 minutes at 4° C, and then washed 3 times. Magnetic beads are detached using 
DetachaBead Mouse CD4 (Dynal). Isolated CD4 T lymphocytes (2 x 10 s cells/well) are 
cultured with 5 x 10 5 irradiated (3500 rad) syngeneic spleen cells in triplicate in flat-bottom 

1 5 96-well microtiter plates. Purified peptides are added to wells at a final concentration of 
20, 1, 0.05 and 0|x g/ml and cells are cultured for a total of 4 days. Approximately 14 hour 
before harvesting, 1 jaCi of 3 H-thymidine (ICN) is added to each well. The wells are 
harvested onto Unifilter GF/B plates (Packard) using the Filtermate Harvester (Packard). 
H-Thymidine incorporation is determined by liquid scintillation counting using the 

20 TopCount™ microplate scintillation counter (Packard). 



Example 4: 

5 Chromium Release Assay 

This assay to measure CTL activity is well known in the art. The assay quantifies 

25 the lytic activity of the T cell population by measuring the percent 51 Cr released from a 
51 Cr-labeled target population (Brunner et al., Immunology, Vol. 14(2):181-96 (1968)). 
Data derived from the chromium release assay is usually expressed either as a CTL 
frequency/10 6 cell (limiting dilution analysis, LDA; {Current Protocols in Immunology, 
Vol 1, John Wiley & Sons, Inc., USA 1991 Chapter 3; Manual of Clinical Laboratory 

30 Immunology, Fifth edition, ASM Press, 1997 Section R), or by a less cumbersome 

quantitative assessment of bulk CTL activity (lytic Units; LU assay). In a LU assay, the 
standard E:T ratio versus percent cytotoxicity data curves generated in a 51 Cr-release assay 
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are converted into lytic units (LU) per 10 effector cells, with 1 LU defined as the lytic 
activity required to achieve 30% lysis of target cells (Wunderlick, J., Shearer, G., and 
Livingston, A. In: J. Coligan, A. Kruisbeek, D. Margulies, E. Shevach, and W. Strober 
(Eds.), Current Protocols in Immunology, Vol 1, "Assays for T cell function: induction and 
5 measurement of cytotoxic T lymphocyte activity." John Wiley & Sons, Inc., USA, p. 
3.11.18). The LU calculation allows quantifying responses and thus readily comparing 
different experimental values. 

. Example 5: 

10 In situ IFNy ELISA 

An in situ DFNy ELISA assay has been developed and optimized for both freshly 
isolated and peptide-restimulated splenocytes {see, e.g., McKinney et ah, J. Immunol. 
Meth 237 (1-2):105-1 17 (2000))IFN. This assay is based on the ELISPOT assay, but 
utilizes a soluble chromagen, making it readily adaptable to high-throughput analysis. In 

15 both the primary and restimulation assays, this technique is more sensitive than either a 
traditional supernatant ELISA or the 51 Cr-release assay, in that responses are observed in 
the in situ ELISA that are not detectable in these other assays. On a per cell basis, the 
sensitivity of the in situ ELISA is approximately one IFNy secreting cell/10 4 plated cells. 
96-well ELISA plates are coated with anti-IFNy (rat anti-mouse IFNoc MAb, Clone 

20 R4-6A2, Pharmingen) overnight at 4°C, and then blocked for 2 hours at room temperature 
with 10% FBS in PBS. Serially diluted primary splenocytes or CTLs are cultured for 20 
hours with peptide and 10 5 Jurkat A2.1/K b cells/well at 37°C with 5% C0 2 . The following 
day, the cells are washed out and the amount of DFNy that had been secreted into the wells 
is detected in a sandwich ELISA, using biotinylated a-IFNy (rat anti-mouse IFNy mAb, 

25 Clone XMG1.2, Pharmingen) to detect the secreted IFNy. HRP-coupled strepavidin 

(Zymed) and 1MB (ImmunoPure® TMB Substrate Kit, Pierce) are used according to the 
manufacturer's directions for color development. The absorbance is read at 450 nm on a 
Labsystems Multiskan RC ELISA plate reader. In situ IFNy ELISA data is evaluated in 
secretory units (SU), based on the number of cells that secrete 100 pg of IFNy in response 

30 to a particular peptide, corrected for the background amount of IFN in the absence of 
peptide. 
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Example 6: 
ELISPOT Assay 

The ELISPOT assay quantifies the frequency of T cells specific for a given peptide 
5 by measuring the capacity of individual cells to be induced to produce and release specific 
lymphokines, usually IFNy. The increased sensitivity of the ELISPOT assay has allowed 
investigators to detect responses from cells freshly isolated from infected humans or 
experimental animals (Murali-Krishna et aL, Immunity, Vol. 8(2):177-87 (1998); Ogg et ai., 
Science, Vol. 279(5359):2103-6 (1998)). The ELISPOT assays are conducted as described 

1 0 above for the IFNy ELISA until the final steps, where ExtrAvidin-AP (Sigma, 1 :500 

dilution) is used in place HRP-strepavidin. Color is developed using the substrate 5-BCDP 
(BioRad) according to the manufacturer's directions. Spots are counted using a phase 
contrast microscope. Alternatively, spots are counted utilizing the Zeiss KS ELISPOT 
reader. In this case the BCIP/NBT substrate is used. 

1 5 The ELISPOT assay is routinely utilized to quantitate immune responses. The spots 

can be manually counted, however, in a preferred mode, a KS ELISPOT reader from Zeiss, 
a microscope-based system with software specifically designed to recognize and count 
spots is used. 

20 Example 7: 

Tetramer Staining 

Tetramer staining is a flow cytometric technique that detects epitope-specific 
human CD8 + T-lymphocytes based on the interaction between the peptide epitope, class I 
antigen and the T-cell receptor specific for the epitope. This assay allows for the rapid 

25 quantitation of epitope specific human CD8 + T-lymphocytes in freshly isolated blood 

samples. MHC tetramers for various HIV peptide/HLA combinations, obtained, e.g., from 
the NIH repository (Tetramer Core Facility: http://www.miaid.nih.gov/reposit/ 
tetramer/index.html). To label epitope-specific cells, 1 xlO 6 PBMC in a 100 \l\ volume are 
incubated in the dark for 40 minutes with 5 fig/ml of the appropriate tetramer plus 

30 monoclonal antibodies that recognize human CD3 and CD8 (available in different 

fluorochrome-conjugated forms from commercial sources including PharMingen, San 
Diego, CA or BioSource, Camarillo, CA). The cells are washed and paraformaldehyde 
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fixed prior to analysis using a FACsan or FACSCalibur flow cytometer (Becton Dickinson 
Immunocytometry Systems, San Jose, CA). Sample data are analyzed using CellQuest 
software. 

5 Example 8: 

Assays from Clinical Samples 

Various assays to evaluate the specific CD8 + CTL activity in frozen PBMC samples 
from patients or volunteers can be used. ELISPOT, chromium release, in situ IFNy release, 
proliferation and tetramer assays are all useful to evaluate responses from various 
1 0 experimental models, e.g. , those of murine and/or primate origin. 

Experimental methods for the murine version of these assays are described above, 
and these are adapted to human systems as described (Livingston et al, J Immunol, Vol. 
159(3):1383-92 (1997); Heathcote et al, Hepatology, Vol. 30(2):531-6 (1999); Livingston 
et al., J Immunol Vol. 162(5):3088-95 (1999)) and can be further adapted a recognized by 
15 one of ordinary skill in the art. Calculations on the amounts of frozen PBMC samples 
necessary to complete the assays are also described greater detail in Example 14. 

Example 9: 
TransRenic Animals 

20 Transgenic mice (HLA-A2. 1/K b H2 b ; HLA-A1 1/K b ; HLA-B7/K b ) are immunized 

intramuscularly in the anterior tibialis muscle or subcutaneously in the base of the tail with 
doses up to 100 jig of DNA or peptide in 10-100 ^1 volumes. DNA is formulated in saline, 
and peptides in IFA. 11-21 days later, the animals are sacrificed using CO2 asphyxiation, . 
their spleens removed and used as the source of cells for in vitro determination of CTL 

25 function. Typically, 3-6 mice per experimental group are used. In addition, spleens from 
non-immunized mice are used as a source of APC for restimulation of CTL cultures. Both 
males and females of 8- 1 2 weeks of age are used. 
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Example 10: 

Demonstration of Simultaneous Induction of Responses Against Multiple CTL and HTL 
Epitopes 

Construction and testing of CTL epitope strings: 
5 This example provides, an example of testing mutliple CTL and HTL epitopes. For 

example, epitope strings encompassing 10-12 different CTL epitopes under the control of a 
single promoter are synthesized and incorporated in a standard plasmid, pcDNA 3.1 
(Invitrogen, San Diego). These constructs include a standard signal sequence and a 
universal HTL epitope, PADRE®. Each set of epitopes is chosen to allow balanced 

10 population coverage. To facilitate testing and optimization, a balanced representation of 
epitopes that have been shown to be immunogenic in transgenic mice, and/or antigenic in 
humans are included. 

The specific order of these CTL epitopes is chosen to minimize Class I junctional 
motifs by the use of the computer program, as described herein. If, despite best efforts 

15 regarding order optimization, potential junctional epitopes are still present in a construct in 
accordance with the invention, corresponding peptides are synthesized to monitor for CTL 
responses against such epitopes in HLA transgenic mice. Generally, minimization of 
junctional motifs is successful and adequate. However, if responses against any junctional 
epitopes are detected, these junctional epitopes are disrupted by the use of short one to two 

20 residue spacers, such as K, AK, KA, KK, or A, compatible with expected proteolytic 
cleavage preferences discussed in the previous sections. 

Since the ultimate use of optimized constructs is a human vaccine, optimized 
human codons are utilized. However, to facilitate the optimization process in HLA 
transgenic mice, care are applied, to select, whenever possible, human codons that are also 

25 optimal for mice. Human and murine codon usage is very similar. See, for example, 
Codon usage database at http://www.kazusa.or.jp/codon/. 

Human cells transfected with the various multi-epitope nucleic acid vaccine 
constructs can be used in antigenicity assays, conducted in parallel with in vivo testing in 
HLA transgenic mice. Any potential discrepancy between multi-epitope nucleic acid 

30 vaccine efficacy, due to the differential codon usage, is addressed by the availability of 
these two different assay systems. 
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Typically, antigenicity and imnmnogenicity testing of plasmid constructs is 
conducted in parallel. In vivo testing in transgenic mice are performed for A2, Al 1, and B7 
HLA transgenic mice. Following a protocol well established in our laboratory, cardiotoxin 
pretreated mice are injected i.m. with 100 jig of each plasmid and responses evaluated 
5 eleven days later (Ishioka et al., J Immunol Vol. 162(7):3915-25 (1999)). Assays will 
include ELISPOT from freshly isolated cells, as well as interferon gamma release and 
cytotoxicity chromium release assays from restimulated cell cultures. All of the above 
. mentioned techniques are well established in the art. The simultaneous measurement of 
responses against epitopes is not problematic, as large colonies of transgenic mice are 

10 already established "in house" for these HLA types. Groups of four to six mice are 

adequate to measure responses against six to ten different epitopes, in multiple readout 
assays. Testing of HLA A2-restricted, HIV-derived epitopes in HLA A2 transgenic mice is 
typically employed. However, should problems be encountered, antigenicity testing using 
human APC can be used as an alternative strategy, or, can be used to complement the 

1 5 transgenic mice studies. 

For the purpose of extending the correlation between immunogenicity in transgenic 
animals and antigenicity, as noted in the studies reported herein, antigenicity testing is 
utilized to evaluate responses against epitopes such as Pol 498, Env 134, Nef 221, Gag 271, 
for which high affinity CTL lines are already available in house. For the purpose of 

20 generating additional suitable CTL lines, direct immunization of HLA transgenic mice with 
peptides emulsified in adjuvant, or lipopeptides are utilized, as described herein, and . 
routinely applied in our laboratory, to generate lines for use in antigenicity assays. 

Antigenicity assays are also used, as a primary readout for epitopes for which in 
vivo optimization experiments are not feasible. These epitopes include A24 and possibly 

25 Al restricted epitopes, as well as any epitope which is non-immunogenic in HLA 

transgenic mice. In any such cases, we use human CTL lines, generated from pathogen 
exposed individuals. Alternatively, we generate CTL lines for in vitro CTL induction, 
using GMCSF/DL4-induced dendritic cells and peripheral blood lymphocytes (Celis et al., 
Proc Natl Acad Set USA, Vol. 91(6):2105-9 (1994)). 

30 Episomal vectors encoding the multi-epitope constructs are generated and 

transfected into appropriate human cell lines to generate target cells. For example, the 
human T cell line Jurkat can be used, but lymphoblastoid cell lines have also been 
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successfully utilized. For experiments utilizing CTL lines of human origin, well- 
characterized HLA-matched, homozygous, EB V cell lines are commonly used as a source 
of purified-MHC and as target cells and are used as recipients of the multi-epitope nucleic 
acid transfections. For experiments utilizing CTL lines derived from HLA transgenic mice, 
5 a collection of Class I negative, EBV-transformed, mutant cell lines .221 (Shimizu Y, 
DeMars R, J Immunol, Vol. 142(9):3320-8 (1989)) transfected with matching HLA/K b 
chimeric constructs are used as the recipient of the multi-epitope nucleic acid transfections. 
Such cells effectively present peptide antigens to CTL lines (Celis et al., Proc Natl Acad 
Sci USA, Vol. 91(6):2105-9 (1994)). 

10 

Construction and testing of HTL epitope strings: 

Epitope strings encompassing 3-20 different HTL epitopes under the control of a 
single promoter are synthesized and incorporated into a standard plasmid, pcDNA 3.1 
(Invitrogen, San Diego). To facilitate testing and optimization, each set of epitopes for a 

15 given construct is chosen to provide a balanced representation of epitopes which are 

already known to be immunogenic in IA b mice. In addition, all the peptides corresponding 
to junctions are synthesized and tested for binding to IA b and, most importantly, for binding 
to a panel of fourteen different DR molecules, representative of the most common DR 
alleles worldwide (Southwood et al., J Immunol, Vol. 160(7):3363-73 (1998)). Thus, HTL 

20 epitopes that are not directed to an antigen of interest are not created within these plasmids. 
However, should junctional regions with good DR binding potential (and hence, potential 
DR restricted immunogenicity in vivo) be detected, a spacer such as GPGPG is introduced 
to eliminate them. In all constructs, the number of Class I junctional motifs will also be 
minimized, as described herein. 

25 Experimental vaccine plasmids are tested for immunogenicity using HLA DR 

transgenic mice and/or mice of the H2b haplotype. Proliferation and/or cytokine 
production are measured (IL5, IFNy). In a typical protocol, cardiotoxin treated mice are 
injected i.m. with 100 jig of each plasmid and responses evaluated eleven days later 
(Ishioka et al., J Immunol, Vol. 162(7):3915-25 (1999)). 

30 
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Testing for interactions between CTL and HTL epitopes 

The activities described above yield small, functional blocks of epitopes, which are 
utilized to demonstrate simultaneous responses / antigenicity against all epitopes 
analyzable. These blocks are the subject to further optimization, as described in the next 
5 example. Using these same constructs, immunodominance is assessed. Specifically, all the 
CTL epitope constructs are mixed together, or all the HTL epitope constructs are mixed 
together. The results obtained with the pool of constructs are then compared with the 
results obtained with the same construct, injected separately. 

These constructs are also used to determine the effects of HTL epitopes on 

10 responses to CTL epitopes. Specifically, HTL and CTL containing plasmids are pooled 
and injected in mice, and CTL and HTL responses to selected epitopes are measured as 
described herein. Often, it is determined whether the presence, e.g., of HTL epitopes 
derived from the target antigen enhances CTL responses beyond the level of response 
attained using a plasmid-containing a pan DR binding epitope, e.g. , PADRE or a 

15 PADRE® family molecule, in the CTL epitope constructs. Typically, it is also determined 
whether PADRE® inhibits or augments responses to target antigen-derived HTL epitopes or 
conversely, if HTL epitopes derived from the antigen of interest inhibit or augment 
responses to PADRE®. 

Responses to a large number of epitopes is attainable using this methodology. It is 

20 possible that the pooling of constructs may inhibit responses against some of the weaker 
epitopes. In this case, the pooling experiments are repeated after optimization. 

Example 11: 

Optimization of CTL and HTL Multi-epitone Constructs 

25 This example describes the optimization the CTL and HTL constructs described in 

Example 10. The potential influence of flanking residues on antigenicity and 
immunogenicity is also assessed in optimizing minigen constructs. These studies involve 
the inclusion of flanking residues, a synonym for which is "spacers," which have been 
designed to facilitate effective processing. 

30 Such an analysis can be performed as follows. First, the results of testing of the 

CTL multi-epitope constructs described in Example 10 are analyzed for trends and 
correlations between activity and the presence of specific residues at the 3 residues flanking 
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the epitope's N- and C-termini. Epitopes for which suboptimal CTL priming is noted, that 
are suboptimal with respect to magnitude of response, are the targets for flanking region 
optimization. For each of the CTL multi-epitope nucleic acid vaccines, encoding 10-12 
different CTL epitopes, 'second generation 5 multi-epitope nucleic acid vaccines, with 
5 optimized configuration, are produced. 

In one embodiment, the first optimization design is to introduce either an Alanine 
(A) or Lysine (K) residue at position C+l for all epitopes associated with suboptimal 
performance. A second optimization design is to introduce in the C+l position, the residue 
naturally occurring in the target antigen, HIV, that are associated with antigenicity. 

10 For selected epitopes, additional modifications are also introduced. Specifically, 

the effect of introducing other residue spacers at the epitope C- and N- termini are also 
investigated. Depending on the results of the analysis of the multi-epitope nucleic acid 
vaccines described in Example 10, residues investigated may further include, for example, 
G, Q, W, S and T. If junctional epitopes are created by these modifications, alternative 

15 epitope orders eliminating the junctional epitopes, are rationally designed, as described 

herein. All second generation constructs are tested for antigenicity and immunogenicity, as 
described herein. 

As a result of these modifications, variations in activity that correspond to specific 
modifications of the multi-epitope constructs are identified. Certain modifications are 
20 found that have general, beneficial effects. To confirm this, generation and testing of 
additional multi-epitope nucleic acid vaccines in which all epitopes (also the ones which 
displayed acceptable antigenicity and immunogenicity) are subject to the same 
modification are conducted. In some instances, increased activity is noted for some 
epitopes but not others, or less desirably that certain modifications increase the activity of 
25 some, but decrease the activity of other epitopes. Li such cases, additional multi-epitope 
nucleic acid vaccines are designed and tested, to retain the beneficial modifications, while 
excluding those alterations that proved to be detrimental or have no effect. 

These multi-epitope nucleic acid vaccines are designated so that: a) a minimum of 
predicted junctional epitopes are present; and b) the epitopes which were not functional in 
30 the previous multi-epitope nucleic acid vaccines are now in a more efficacious context. 

For HTL multi-epitope constructs, the data obtained from the "first generation" 
constructs are inspected for trends, in terms of junctional epitopes, and epitope position 
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within the constructs, and proximity to spacers, e.g. GPGPG spacers. If specific trends are 
detected, second generation constructs are designed based on these trends. Alternatively, in 
case of multi-epitope consructs yielding suboptimal activity, the potential effectiveness of 
other targeting strategies, such as the ones based on Ii and LAMP are reevaluated, and 
5 compared to no targeting and simple, leader sequence targeting. 

When large variations in activity of either the CTL or HTL multi-epitope constructs 
described in this section are detected, the results are consistent with influences such as 
conformational or "long-range" effects impacting construct activity. These variables can 
be analyzed by means of current molecular and cellular biology techniques. For example, 

10 cell lines transfected with the various multi-epitope constructs could be analyzed for 
mRNA expression levels, and stability by Northern analysis or primer extension assays 
(Current Protocols in Molecular Biology, Vol 3, John Wiley & Sons, Inc. USA 1999). 

In all multi-epitope nucleic acid vaccines, an antibody tag such as MYC/his can also 
be included. This tag allows for testing of protein expression levels. The inclusion of 

15 MYC/his tag (Manstein et al., Gene, Vol. 162(l):129-34 (1995)) also allows determination 
of the stability of the expressed products, by pulse-chase experiments. The results of these 
assays can then be compared with the results of the antigenicity and immunogenicity 
experiments. The results are inspected for the presence of trends and general rules, and 
correlation between the different variables examined. 

20 

Example 12: 

Determination of the Simplest Plasmid Configuration Capable of Effective Delivery of : 
Selected Epitopes 

The experiments described in Examples 1 1 and 12 are designed to address variables 
25 concerning multi-epitope nucleic acid vaccine design. Ideally, a vector that can be used in 
humans is used through the entire program, but one DNA vaccine plasmid for the vaccine 
epitope optimization studies can be used and then switched to a vector suitable for human 
use. Actual vector selection is dependent on several variables. For example, the 
availability of vectors, suitable for human use, through a reliable source, such as the 
30 National Gene Vector Laboratory (University of Michigan) is a factor. 

In this example, the optimized constructs are also ligated to form larger blocks of 
epitopes. All constructs are preferably designed to incorporate PADRE® peptides and 
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leader sequence targeting in the case of CTL multi-epitope constructs. Specifically, two 
pairs of the 10-12 CTL epitope constructs are ligated to generate two 20-24 CTL epitope 
constructs. In a situation where ligation of epitopes yields suboptimal (decreased) activity 
compared to the smaller constructs, alternative combinations and orders of ligation are 
5 investigated. The specific pair of 20-24 CTL epitope constructs yielding optimal activity 
are then ligated and the resulting construct encompassing all CTL epitopes evaluated for 
activity. Once again up to two alternative orientations are investigated. Because of the • 
relatively large size of this construct, the specific effect of targeting sequences are 
confirmed, since it is possible that leader sequence targeting are more effective on small 
10 size constructs, while larger size constructs may be most effectively targeted by ubiquitin 
signals. Specifically, one construct without any specific targeting sequences is generated 
and compared to a construct that is targeted for degradation by the addition of a ubiquitin 
molecule. 

A similar strategy is used for HTL. Two pairs of the 3-5 HTL epitope constructs 
15 are ligated to generate two 7-9 HTL epitope constructs. Once again, in a situation where 
ligation of these epitopes yields suboptimal (decreased) activity, alternative combinations 
and order of ligation are investigated. The specific pair of 7-9 CTL epitope constructs 
yielding optimal activity are ligated and the resulting construct, encompassing all HTL 
epitopes, is evaluated for activity. Once again, up to two alternative orientations are 
20 investigated. 

Based on these results an optimized plasmid configuration capable of effective 
delivery of a panel, e.g., of HIV epitopes, are selected for clinical trial evaluation. Of 
course, epitopes from any antigen of interest (infectious or disease-associated) can be used 
alone or in combination. This configuration will entail one or more HTL epitope constructs 

25 and one or more CTL epitope constructs. A combination of one long CTL and one long 
HTL epitope construct capable of effectively delivering all epitopes, is most preferable, as 
it simplifies further clinical development of the vaccine. In case undesirable interactions 
between the two constructs are observed when co-injected, injection of the different 
plasmids in the same animals, but in different injection sites, or at different points in time 

30 are examined. Alternatively, if a single CTL construct and HTL construct encoding all the 
desired epitopes is not identified, pools of constructs are considered for further 
development. 
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Example 13: 

Evaluation and Characterization of CD8 ± Lymphocyte Responses Induce Following 
Immunization With Multi-Epitope Vaccine 
5 CD8+ lymphocyte responses were measured mostly relying on the ELISPOT 

technique. The ELISPOT assay is known in the art, and is regularly used in our laboratory. 
An automated Zeiss ELISPOT reader is also used as set forth herein. The assays utilized to 
measure CD8+ responses are primarily the IFNy ELISPOT assay on freshly isolated cells 
as well as cells restimulated in vitro with peptide. In addition, in selected instances, 

10 chromium release assays are utilized. The results were correlated with the ones observed in 
the case of the ELISPOT assays. Tetramer staining on selected peptide/MHC combinations 
was also performed. 

The clinical assay was developed and validated. The timing of this activity 
coincides with the period of time that follows selection of a clinical vaccine EpiGene™, 

1 5 and precedes the availability of actual samples from individuals enrolled in the clinical 
trial. Assays for CTL evaluation can be established based on experience in the art, for 
example, experience in establishing assays for CTL evaluations in the Phase I and II trials 
of the experimental HBV vaccine, Theradigm (Livingston et al, J Immunol, Vol. 
159(3):1383-92 (1997); Heathcote et al., Hematology, Vol. 30(2):531-6 (1999); Livingston 

20 et al, J Immunol, Vol. 162(5):3088-95 (1999)). Specifically, Ficoll-purified PBMC 

derived from normal subjects, as well from, e.g., unvaccinated volunteers can be used. As 
noted previously, other antigenic target(s) can be used in accordance with the invention. 

Example 14: 

25 Design of Optimized Multi-Epitope DNA-based Vaccine Constructs 

Optimized constructs were designed with the aid of the computer-assisted methods 
described above which simultaneously minimize the formation of junctional epitopes and 
optimize C+l processing efficiency. The following motifs were utilized for junctional 
minimization: murine K b (XXXX(FY)X 2 . 3 (LIMV)); D b (XXXXNX 2 - 3 LIMV)); human A2 

30 (X(LM)X 6 - 7 V); human A3/A1 1 (X(LMV)X 6 . 7 (KRY)); and human B7 (XPX 6 . 7 (LIMVF)). 
The C+l propensity values were calculated from the data presented in Figure 6 and are as 
follows^. K = 2.2; N = 2; G = 1.8; T = 1.5; A,F,S = 1.33; W,Q = 1.2; R = 1.7; M,Y = 1; I = 
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0.86; L = 0.76; V,D,H,E,P = 0. Insertion of up to four amino acids was permitted. 
Examples of constructs designed by this procedure and other procedures set forth herein are 
depicted in Figure 18. A number of these constructs were characterized in vitro and in vivo 
immunogenicity studies, which are set forth hereafter. Figure 19 lists amino acid epitope 
5 sequences encoded by certain nucleic acid sequences in the multi-epitope constructs. 

Example 15: 

Immunogenicity Testing of Multi-epitope CTL Constructs and Influence of Flanking 
Amino Acids 

10 HLA transgenic mice were used for immunogenicity testing of different multi- 

epitope constructs. One group of mice were pretreated by injecting 50 ix\ of 10 ixM 
cardiotoxin bilaterally into the tibialis anterior muscle, and then four or five days later, 100 
\xg of a DNA construct diluted in PBS was administered to the same muscle. In another 
group, each mouse was injected with a peptide emulsified in CFA, wherein the peptide 

1 5 corresponds to an epitope within the DNA construct administered to mice in the DNA 
injection group. Eleven to fourteen days after immunization, splenocytes from DNA 
vaccinated animals and peptide vaccinated animals were recovered and CTL activity was 
measured in one of several assays, including a standard 5I Cr-release assay, an ELISPOT 
assay that measured y-IFN production by purified CD8+ T-lymphocytes without peptide 

20 epitope-specific restimulation, and an in situ ELISA, which included an in vitro epitope- 
specific restimulation step with a peptide epitope. Examples of CTL activity induced by 
the EP-HIV-1090 construct upon stimulation with peptide epitopes are shown in Figure 
14A, and CTL activity induced by the PfCTL.l, PfCTL.2, and PFCTL.3 constructs upon 
stimulation with peptide epitopes are shown in Figure 14B. 

25 The effect of different amino acids in the C+l flanking position was directly 

evaluated by inserting different amino acids at the C+l position relative to the Core 18 
epitope in the HBV.l construct. The immunogenicity data clearly demonstrate reduced 
immunogenicity of the Core 18 epitope when it was flanked at the C+l position by W, Y, 
or L (Figure 6b). In contrast, insertion of a single K residue dramatically increased the 

30 CTL response to Core 18. Enhancement of CTL responses was also observed using R, C, 
N, or G at the C+l position. These data clearly demonstrate that C+l processing 
optimization can improve multi-epitope construct design. 
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Example 16: 

Immunogenicitv Testing of Multi-epitope HTL Constructs and Influence of Spacer 
Sequences 

5 A universal spacer consisting, of GPGPG was developed to separate HTL epitopes, 

thus disrupting junctional epitopes. The logic behind the design of this spacer is that 
neither G nor P are used as primary anchors, positions 1 and 6 in the core region of an HTL 
peptide epitope, by any known murine or human MHC Class IMHC Class II molecule. 
The gap of five amino acids introduced by this spacer separates adjacent epitopes so the 

10 amino acids of two epitopes cannot physically serve as anchors in the 1 and 6 positions. 
The utility of the GPGPG spacer was tested using synthetic peptides composed of four 
HIV-1 epitopes, one having three spacers and the other lacking spacers, known to bind 
mouse IA b . HTV 75mer was the construct having three GPGPG spacers and HIV 60mer 
was the construct lacking the three spacers. Immunization of CB6F1 mice with the peptide 

15 in CFA induced HTL responses against 3 of 4 of the epitopes in the absence of the spacer 
but all epitopes were immunogenic when the spacer was present (Figure 15). This 
evidence demonstrates that spacers can improve the performance of multi-epitope 
constructs. 

The ability of multi-epitope HTL DNA-based constructs to induce an HTL response 
20 in vivo was evaluated by intramuscular immunization of H2 bxd mice with an EP-HIV- 1043- 
PADRE® construct. The EP-HIV- 1 043 -PADRE® construct is set forth in Figure 18, and 
the difference between EP-HIV-1043-PADRE® and EP-HIV- 1043 is that the former 
includes a C-terminal GPGPG spacer followed by the PADRE® sequence 
AKFVAAWTLKAAA. Eleven days after immunization, no booster immunizations were 
25 administered, CD4 T cells were purified from the spleen, and peptide specific HTL 

responses were measured in a primary y-IFN ELISPOT assay. Examples of HTL activity 
induced by constructs encoding HIV epitopes are shown in Figure 16. Overall, the HTL 
responses induced by DNA immunization with the multi-epitope HIV HTL construct were 
generally of equal or greater magnitude than the responses induced by peptide 
30 immunization. 
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Thus, as described above, the invention provides a novel method and system for 
automatically analyzing polypeptide junctions, eliminating or reducing the number of 
junctional epitopes, and identifying spacer combinations to optimize the efficacy of multi- 
epitope constructs. Those skilled in the art will know, or be able to ascertain using no more 
than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. These equivalents are intended to be encompassed by the 
following. claims. 
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WHAT IS CLAIMED IS: 

1 . A computerized method for designing a multi-epitope construct having 
multiple epitope nucleic acids, the method comprising the steps of: 

storing a plurality of input parameters in a memory of a computer system, said input 
5 parameters comprising a plurality of epitopes, at least one motif for identifying junctional 
epitopes, a plurality of amino acid insertions and at least one enhancement weight value for 
each insertion; 

generating a list of epitope pairs from said plurality of epitopes; 
determining for each of said epitope pairs at least one optimum combination of 
10 amino acid insertions based on said at least one motif, said plurality of insertions and said 
at least one enhancement weight value for each insertion; and 

identifying at least one optimum arrangement of said plurality of epitopes, wherein 
a respective one of said at least one optimum combination of amino acid insertions is 
inserted at a respective junction of two epitopes, so as to provide an optimized multi- 
15 epitope construct. 



20 



2. The method of claim 1 wherein said step of identifying at least one optimum 
arrangement comprises performing an exhaustive search wherein all permutations of 
arrangements of said plurality of epitopes are evaluated. 

3. The method of claim 1 wherein said step of identifying at least one optimum 
arrangtnent comprises performing a stochastic search wherein only a subset of all 
permutations of arrangements of said plurality of epitopes are evaluated. 



25 4. The method of claim 1 wherein said step of identifying at least one optimum 

arrangement comprises: 

performing an exhaustive search of all permutations of arrangements of said 

plurality of epitopes when the number of epitopes to be included in said multi-epitope 

construct is less than a specified value X; and 
30 performing a stochastic search, wherein only a subset of all peraiutations of 

arrangements of said plurality of epitopes are evaluated, when the number of epitopes to be 

included in said multi-epitope construct is greater than or equal to X. 
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5. The method of claim 1 wherein said plurality of input parameters further 
includes a maximum number of insertions (Maxlnsertions) value, and said step of 
detemiining for each epitope pair at least one optimum combination of amino acid 

5 insertions comprises calculating a function value (F) for each possible combination of 

insertions for each epitope pair, wherein the number of insertions in a combination is in the 
range of 0 to Maxlnsertions, said function value being calculated in accordance with the 
equation F = (ON)/J, when J > 0, and F = 2(C+N), when J = 0, wherein C equals the 
enhancement weight value of a C+l flanking amino acid, N equals the enhancement weight 
10 value of an N-l flanking amino acid, and J equals the number of junctional epitopes 

detected for each respective combination of insertions in an epitope pair based on said at 
least one motif. 

6. A computerized method for designing a multi-epitope construct having 
1 5 multiple epitopes, the method comprising the steps of: 

storing a plurality of input parameters in a memory of a computer system, said input 
parameters comprising a plurality of epitopes, at least one motif for identifying junctional 
epitopes, a plurality of amino acid insertions, a C+l enhancement weight value for each 
insertion, a N-l enhancement weight value for each insertion, and a maximum number of 
20 insertions (Maxlnsertions); 

generating a list of epitope pairs from said plurality of epitopes; 
for each combination of insertions for each epitope pair, wherein the number of 
insertions is in the range of 0 to Maxlnsertions, calculating a function value (F) using the 
equation F = (C+N)/J, when J > 0, and F = 2(C+N), when J = 0, wherein C equals a C+l 
25 enhancement weight value of a respective flanking amino acid insertion, N equals a N-l 
enhancement weight value of a respective N-l flanking amino acid insertion, and J equals 
the number of junctional epitopes detected for each respective combination of insertions in 
an epitope pair based on said at least one motif; 

determining for each epitope pair at least one optimal combination of insertions 
3 0 yielding a maximum function value F; 

generating a list of optimal combinations of insertions; and 

89 



WO 02/083714 



PCT/US02/09877 



based on said list of optimal combinations of insertions, identifying at least one 
optimum permutation of said multi-epitope construct comprising said plurality of epitopes 
arranged in an order that yields a maximum sum of function values, wherein a respective 
one of said optimal combinations of insertions are inserted at a respective junction of two 
5 epitopes of said multi-epitope construct. 

7. The method of claim 6 wherein said step of identifying at least one optimum 
peimutation comprises performing an exhaustive search wherein all permutations of 
arrangements of said plurality of epitopes are evaluated. 

10 

8. The method of claim 6 wherein said step of identifying at least one optimum 
permutation comprises performing a stochastic search wherein only a subset of all 
permutations of arrangements of said plurality of epitopes are evaluated. 

15 9. The method of claim 8 wherein said plurality of input parameters further 

comprises a maximum search time (MaxSearchTime) value and said stochastic search is 
performed for a period of time approximately equal to said MaxSearchTime value, wherein 
said at least one optimum permutation comprises at least one permutation evaluated as 
having a maximum sum of function values. 

20 

10. The method of claim 6 wherein said step of identifying at least one optimum 
permutation comprises: 

performing an exhaustive search of all permutations of arrangements of said 
plurality of epitopes when the number of epitopes to be included in said multi-epitope 
25 construct is less than a specified value X; and 

performing a stochastic search, wherein only a subset of all permutations of 
arrangements of said plurality of epitopes are evaluated, when the number of epitopes to be 
included in said multi-epitope construct is greater than or equal to X. 

30 1 1 . A computer system for designing a multi-epitope construct having multiple 

epitopes, the system comprising: 
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a memory for storing a plurality of input parameters, said input parameters 
comprising a plurality of epitopes, at least one motif for identifying junctional epitopes, a 
plurality of amino acid insertions and at least one enhancement weight value for each 
insertion; 

5 a processor for retrieving said input parameters from said memory and generating a 

list of epitope pairs from said plurality of epitopes; 

said processor further determining for each of said epitope pairs at least one 
optimum combination of amino acid insertions, based on said at least one motif, said 
plurality of insertions and said at least one enhancement weight value for each insertion; 
10 said processor further identifying at least one optimum arrangement of said plurality 

of epitopes, wherein a respective one of said optimum combinations of amino acid 
insertions are inserted at a respective junction of two epitopes, so as to provide an 
optimized multi-epitope construct; and 

a display monitor, coupled to said processor, for displaying said at least one 
15 optimum arrangement of said plurality of epitopes to a user. 

12. The system of claim 1 1 wherein said processor, when identifying at least 
one optimum arrangement of said plurality of epitopes, performs an exhaustive search 
wherein all permutations of arrangements of said plurality of epitopes are evaluated to 

20 identify at least one optimized multi-epitope construct. 

13. The system of claim 1 1 wherein said processor, when identifying at least 
one optimum arrangement of said plurality of epitopes, performs a stochastic search 
wherein only a subset of all permutations of arrangements of said plurality of epitopes are 

25 evaluated to identify at least one optimized multi-epitope construct. 

14. The system of claim 1 1 wherein said processor, when identifying at least 
one optimum arrangement of said plurality of epitopes, performs an exhaustive search of all 
permutations of arrangements of said plurality of epitopes when the number of epitopes to 

30 be included in said multi-epitope construct is less than a specified value X, and performs a 
stochastic search, wherein only a subset of all permutations of arrangements of said 
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plurality of epitopes are evaluated, when the number of epitopes to be included in said 
multi-epitope construct is greater than or equal to X. 

15. The system of claim 1 1 wherein said plurality of input parameters further 
5 includes a maximum number of insertions (Maxlnsertions) value and said processor, when 
determining for each epitope pair at least one optimum combination of amino acid 
insertions, calculates a function value (F) for each possible combination of insertions for 
each epitope pair, wherein the number of insertions in a combination is in the range of 0 to 
Maxlnsertions, said function value being calculated in accordance with the equation F = 
10 (C+N)/J, when J > 0, and F = 2(C+N), when J = 0, wherein C equals the enhancement 
weight value of a C+l flanking amino acid, N equals the enhancement weight value of an 
N-l flanking amino acid, and J equals the number of junctional epitopes detected for each 
respective combination of insertions in an epitope pair based on said at least one motif. 

15 16. A computer system for designing an optimized multi-epitope construct 

having multiple epitopes, the system comprising: 

an input device for inputting a plurality of input parameters specified by a user; 
a memory, coupled to the input device, for storing said plurality of input 

parameters, said input parameters comprising a plurality of epitopes, at least one motif for 
20 identifying junctional epitopes, a plurality of amino acid insertions, a C+1 enhancement 

weight value for each insertion, a N-l enhancement weight value for each insertion, and a 

maximum number of insertions (Maxlnsertions); 

a processor for retrieving said input parameters from said memory and generating a 

list of epitope pairs from said plurality of epitopes; 
25 wherein said processor, for each combination of insertions for each epitope pair, 

wherein the number of insertions is in the range of 0 to Maxlnsertions, calculates a function 

value (F) using the equation F = (C+N)/J, when J > 0, and F = 2(C+N), when J = 0, wherein 

C equals a C+1 enhancement weight value of a respective flanking amino acid insertion, N 

equals a N-l enhancement weight value of a respective N-l flanking amino acid insertion, 
30 and J equals the number of junctional epitopes detected for each respective combination of 

insertions in an epitope pair based on said at least one motif; and 
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wherein said processor further determines for each epitope pair at least one optimal 
combination of insertions yielding a maximum function value F, generates a list of optimal 
combinations of insertions, and, based on said list of optimal combinations of insertions, 
identifies at least one optimum permutation of said multi-epitope construct comprising said 
5 plurality of epitopes arranged in an order that yields a maximum sum of function values, 
wherein a respective one of said optimal combinations of insertions are inserted at a 
respective junction of two epitopes of said optimized multi-epitope construct. 

17. A data storage device storing a computer program for designing a multi- 
10 epitope construct having multiple epitopes, the computer program, when executed by a 
computer system, performing a process comprising the steps of: 

retrieving a plurality of input parameters from a memory of a computer system, said 
input parameters comprising a plurality of epitopes, at least one motif for identifying 
junctional epitopes, a plurality of amino acid insertions and at least one enhancement 
1 5 weight value for each insertion; 

generating a list of epitope pairs from said plurality of epitopes; 
determining for each of said epitope pairs at least one optimum combination of 
amino acid insertions based on said at least one motif, said plurality of insertions and said 
at least one enhancement weight value for each insertion; and 
20 identifying at least one optimum arrangement of said plurality of epitopes, wherein 

a respective one of said at least one optimum combination of amino acid insertions is 
inserted at a respective junction of two epitopes, so as to provide an optimized multi- 
epitope construct. 

25 18. The data storage device of claim 1 7 wherein said computer program, when 

executed, performs an exhaustive search wherein all permutations of arrangements of said 
plurality of epitopes are evaluated so as to identify said at least one optimum arrangement 
of said plurality of epitopes. 

30 19. The data storage device of claim 1 7 wherein said computer program, when 

executed, performs a stochastic search wherein only a subset of all permutations of 
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arrangements of said plurality of epitopes are evaluated so as to identify said at least one 
optimum arrangement of said plurality of epitopes. 

20. The data storage device of claim 17 wherein said computer program, when 
5 executed, performs an exhaustive search wherein all permutations of arrangements of said 

plurality of epitopes are evaluated, when the number of epitopes to be included in said 
multi-epitope construct is less than a specified value X, and performs a stochastic search, 
wherein only a subset of all permutations are evaluated, when the number of epitopes to be 
included in said multi-epitope construct is greater than or equal to X, so as to identify said 
1 0 at least one optimum arrangement of said plurality of epitopes. 

21. The data storage device of claim 17 wherein said computer program, when 
executed, further retrieves a maximum number of insertions (Maxlnsertions) value from 
said memory of said computer system, and further performs said step of determining for 

15 each epitope pair at least one optimum combination of amino acid insertions comprises by 
calculating a function value (F) for each possible combination of insertions for each epitope 
pair, wherein the number of insertions in a combination is in the range of 0 to 
Maxlnsertions, said function value being calculated in accordance with the equation F = 
(C+N)/J, when J > 0, and F = 2(C+N), when J = 0, wherein C equals the enhancement 

20 weight value of a C+l flanking amino acid, N equals the enhancement weight value of an 
N-l flanking amino acid, and J equals the number of junctional epitopes detected for each ' 
respective combination of insertions in an epitope pair based on said at least one motif. 

22. An apparatus for designing a multi-epitope construct having multiple 
25 epitopes, comprising: 

means for storing a plurality of input parameters in a memory of a computer system, 
said input parameters comprising a plurality of epitopes, at least one motif for identifying 
junctional epitopes, a plurality of amino acid insertions and at least one enhancement 
weight value for each insertion; 
30 means for generating a list of epitope pairs from said plurality of epitopes; 
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means for determining for each of said epitope pairs at least one optimum 
combination of amino acid insertions based on said at least one motif, said plurality of 
insertions and said at least one enhancement weight value for each insertion; and 

means for identifying at least one optimum arrangement of said plurality of 
5 epitopes, wherein a respective one of said at least one optimum combination of amino acid 
insertions is inserted at a respective junction of two epitopes, so as to provide an optimized 
multi-epitope construct. 

23. The apparatus of claim 22 wherein said means for identifying at least one 
10 optimum arrangement comprises means for performing an exhaustive search wherein all 

permutations of arrangements of said plurality of epitopes are evaluated. 

24. The apparatus of claim 22 wherein said means for identifying at least one 
optimum arrangment comprises means performing a stochastic search wherein only a 

15 subset of all permutations of arrangements of said plurality of epitopes are evaluated. 

25. The apparatus of claim 22 wherein said means for identifying at least one 
optimum arrangement comprises: 

means for performing an exhaustive search of all permutations of 
20 arrangements of said plurality of epitopes when the number of epitopes to be included in 
said multi-epitope construct is less than a specified value X; and 

means for performing a stochastic search, wherein only a subset of all 
permutations of arrangements of said plurality of epitopes are evaluated, when the number 
of epitopes to be included in said multi-epitope construct is greater than or equal to X. 

25 

26. The apparatus of claim 22 wherein said plurality of input parameters further 
includes a maximum number of insertions (Maxlnsertions) value, and said means for 
determining for each epitope pair at least one optimum combination of amino acid 
insertions comprises means for calculating a function value (F) for each possible 

30 combination of insertions for each epitope pair, wherein the number of insertions in a 

combination is in the range of 0 to Maxlnsertions, said function value being calculated in 
accordance with the equation F = (C+N)/J, when J > 0, and F = 2(C+N), when J = 0, 
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wherein C equals the enhancement weight value of a C+l flanking amino acid, N equals 
the enhancement weight value of an N-l flanking amino acid, and J equals the number of 
junctional epitopes detected for each respective combination of insertions in an epitope pair 
based on said at least one motif. 

5 

27. A method for designing a multi-epitope construct that comprises two or 
more CTL epitope nucleic acids, the method comprising steps of: 

(i) sorting the CTL epitope nucleic acids to minimize the number of junctional 
epitopes; 

10 (ii) introducing a flanking amino acid residue selected from the group consisting of 

K, R, N, Q, G, A, S, C, and T at a C+l position of a CTL epitope nucleic acids; 

(iii) introducing one or more amino acid spacer residues between two epitope 
nucleic acids, wherein the spacer prevents the occurrence of a CTL or HTL junctional 
epitope; and, 

15 (iv) selecting one or more multi-epitope constructs that have a minimal number of 

junctional epitopes, a minimal number of amino acid spacer residues, and a maximum 
number of K, R, N, G, A, S, C or T at a C+l position relative to each CTL epitope nucleic 
acids. 

20 28. A method for designing a multi-epitope construct that comprises two or 

more HTL epitope nucleic acids, the method comprising steps of: 

(i) sorting said epitope nucleic acids to minimize the number of junctional epitopes; 

(ii) introducing a flanking amino acid residue selected from the group consisting of 
G, P, N or A positioned between said nucleic acid epitopes; and 

25 (iii) introducing one or more amino acid spacer residues between two epitope 

nucleic acids, wherein the spacer prevents the occurrence of an HTL junctional epitope. 

29. The method of claim 27, wherein the spacer residues are independently 
selected from residues that are not known HLA Class II primary anchor residues. 

30 



96 



WO 02/083714 



PCT/US02/09877 



30. The method of claim 27, wherein introducing the spacer residues prevents 
the occurrence of an HTL epitope and further, wherein a spacer comprises at least 5 amino 
acid residues independently selected from the group consisting of G, P and N. 

5 31. The method of claim 30, wherein the spacer is GPGPG. 

32. The method of claim 27, wherein introducing the spacer residues prevents 
the occurrence of an HTL epitope and further, wherein the spacer is 1, 2, 3, 4, 5, 6, 7 or 8 
amino acid residues independently selected from the group consisting of A and G. 

10 

33. The method of claim 27, wherein the flanking residue is introduced at the 
C+l position of a CTL epitope. 

34. The method of claim 27, wherein the flanking residue is selected from the 
1 5 group consisting of K, R, N, G and A. 

35. The method of claim 27, wherein the flanking residue is adjacent to the 
spacer amino acid residues. 

20 36. The method of claim 27, further comprising substituting an N-terminal 

residue of an HLA epitope that is adjacent to a C-terminus of an HLA epitope comprised 
by the multi-epitope construct with a residue selected from the group consisting of K, R, N, 
Gand A. 

25 37. The method of clairu 27, further comprising a step of predicting a structure 

of the multi-epitope construct, and wherein the selecting step further comprises selecting 
one or more multi-epitope constructs that, when introduced into a cell, is processed by an 
HLA processing pathway such that all of the eptiopes included in the multi-epitope 
construct are produced by the HLA processing pathway. 

30 

38. A multi-epitope construct prepared using the method of claim 1 or 27. 
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39. A multi-epitope construct comprising a plurality of CTL epitope nucleic 
acids and a plurality of spacer nucleic acids, wherein: 

the CTL epitope nucleic acids encode class I HLA epitope peptides of about eight to 
about eleven amino acids in length; 
5 the spacer nucleic acids are positioned between the CTL epitope nucleic acids; 

the spacer nucleic acids encode between one and eight amino acids; 
one or more of the spacer nucleic acids encode an amino acid sequence that is 
different than the amino acid sequence encoded by other spacer nucleic acids; and 
each of the spacer nucleic acids optimizes epitope processing and minimizes 
10 junctional epitopes. 

40. The multi-epitope construct of claim 39, further comprising a targeting 
nucleic acid. 

15 41. The multi-epitope construct of claim 39, further comprising a nucleic acid 

sequence encoding a HTL-specific epitope. 

42. The multi-epitope construct of claim 39, wherein two or more of the spacer 
nucleic acids encodes an amino acid sequence that is different than the amino acid 

20 sequence encoded by other spacer nucleic acids. 

43. The multi-epitope construct of claim 39, wherein one or more of the spacer 
nucleic acids encode an amino acid sequence comprising three consecutive alanine 
residues. 

25 

44. The multi-epitope construct of claim 39, wherein the eptiope peptides are 
about nine amino acids in length. 

45. The multi-epitope construct of claim 39, wherein the construct comprises 
30 five or more CTL epitope nucleic acids encoding class I HLA epitope peptides selected 

from the group consisting of FLPSDFFPSV; FLLTRILTI; WLSLLVPFV; GLSRYVARL; 
YMDDWLGV; FLLSLGIHL; ALMPLYACI; STLPETTWRR; HTLWKAGILYK; 
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TLWKAGILYK; LWDFSQFSR; NVSIPWTHK; SAICSWRR; KVGNFTGLY; 
QAFTFSPTYK; LPSDFFPSV; MPSSWAF; TPARVTGGVF; HPAAMPHLL; 
WMMWYWGPSLY; DLLDTASALY; LTFGRETVLEY; LSLDVSAAFY; 
SWPKFAVPNL; RFSWLSLLVPF; EYLVSFGVW; KYTSFPWLL. 

5 

46. The multi-epitope construct of claim 39, wherein the construct comprises 
five or more CTL epitope nucleic acids encoding class I HLA epitope peptides selected 
from the group consisting of FLPSDFFPSV; FLLTRELTI; WLSLLVPFV; FLLSLGIHL; 
YMDDWLGV; STLPETTWRR; HTLWKAGILYK; SAICSWRR; QAFTFSPTYK; 

10 PIPSSWAF; TPARVTGGVF; HPAAMPHLL; FPHCLAFSYM; WMMWYWGPSLY; 
DLLDTASALY; LTFGRETVLEY; LSLDVSAAFY; SWPKFAVPNL; RFSWLSLLVPF; 
EYLVSFGVW; KYTSFPWLL. 

47. The multi-epitope construct of claim 39, wherein the construct comprises 
1 5 five or more CTL epitope nucleic acids encoding class I HLA epitope peptides selected 

from the group consisting of FLPSDFFPSV; FLLTRILTI; WLSLLVPFV; GLSRYVARL; 
YMDDWLGV; FLLSLGIHL; STLPETTWRR; HTLWKAGILYK; LWDFSQFSR; 
NVSIPWTHK; SAICSWRR; QAFTFSPTYK; LPSDFFPSV; BPIPSSWAF; 
TPARVTGGVF; HPAAMPHLL; FPHCLAFSYM; YPALMPLYACI; 
20 WMMWYWGPSLY; DLLDTASALY; LTFGRETVLEY; ASFCGSPY; LSLDVSAAFY; 
ELLLCLIFLL; RWMCLRRFH; SWPKFAVPNL; RFSWLSLLVPF; LWFHISCLTF; 
EYLVSFGVW; KYTSFPWLL. 

48. The multi-epitope construct of claim 39, wherein the construct comprises 
25 five or more CTL epitope nucleic acids encoding class I HLA epitope peptides selected 

from the group consisting of VLAEAMSQV; MTNNPPIPV; MASDFNLPPV; 
KLVGKLNWA; LVGPTPVNI; ILKEPVHGV; KAACWWAGI; KMIGGIGGFI; 
RAMASDFNL; TLNFPISPI; KLTPLCVTL; LLQLTVWGI; SLLNATDIAV; 
LTFGWCFKL; AIIRILQQL; RILQQLLFI; QMAVFIHNFK; KVYLAWVPAHK; 
30 KIQNFRVYYR; AJFQSSMTK; VTIKIGGQLK; TTLFCASDAK; VTVYYGVPVWK; 
QVPLRPMTYK; VMTVWQVDR; QMVHQAISPR; YPLASLRSLF; HPVHAGPIA; 
FPISPIETV; IPYNPQSQGW; EPfflYCAPA; CPKVSFEPI; FPVRPQVPL; VPLQLPPL; 
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EVNIVTDSQY; FRDYVDRFY; VIYQYMDDLY; VTVLDVGDAY; IYQEPFKNL; 

PYNTPVFAI; TYQIYQEPF; YWQATWEW; IWGCSGKLI; RYLKDQQLL; 

VWKEATTTLF; IYETYGDTW; PYNEWTLEL; LLFNELGGWV; FLLLADARV; 

YLVAYQATV; RLIVFPDLGV; DLMGYDPLV; WMNRLIAFA; VLVGGVLAA; 
5 HMWNFISGI; ILAGYGAGV; YLLPKRGPRL; LLFLLLADA; YLVTRHADV; 

KTSERSQPR; RLGVRATRK; QLFTFSPRR; RMYVGGVEHR; LIFCHSKKK; 

GVAGALVAFK; VAGALVAFK; LGFGAYMSK; LPGCSFSDF; LSAFSLHSY; 

CTCGSSDLY; LTDPSHTTA; LTCGFADLMGY; LADGGCSGGAY; FWAKHMWNF; 

RMTLMTHFF; VMGSSYGF; FWAKHMWNFI; FMKAVCVEV; GLLGWSTV; 
10 ILSVSSFLFV; QTNFKSLLR; GVSENIFLK; LLACAGLAYK; TPYAGEPAPF; 

LPSENERGY; KYKLATSVL; SFLFVEALF; YFILVNLLI; FLIFFDLFLV; 

VLAGLLGW; VLLGGVGLVL; LACAGLAYK; ALFFUFNK; FILVNLLIFH; 

LPYGRTNL; FVEALFQEY; FQDEEN1GIY; FYFELVNLL; KYLVIVFLI; GLIMVLSFL; 

KILSVFFLA; VTCGNGIQVR; HVLSHNSYEK; PSDGKCNLY; YYIPHQSSL; 
1 5 KFIKSLFHIF; VFLIFFDLFL; LFHIFDGDNEI; YYGKQENWYSL; LYISFYFI; 

FLPSDFFPSV; FLLTPJLTI ; WLSLLVPFV; GLSRYVARL; YMDDWLGV; 

ILRGTSFVYV; FLLSLGIHL; ALMPLYACI; GLSPTVWLSV; STLPETTWRR; 

HTLWKAGILYK; TLWKAGILYK; LWDFSQFSR; NVSIPWTHK; SAICSWRR; 

KVGNFTGLY; QAFTFSPTYK; LPSDFFPSV; IPIPSSWAF; TPARVTGGVF; 
20 RMSRVTTFTV; ALVLLMLPW; LMIGTAAAW; ALVLLMLPV; GLMTAVYLV; 

MALLRLPV; RMFAANLGV; SLYFGGICV; RLPLVLPAV; RLMIGTAAA; 

FWALIPLV; MTYAAPLFV; AMALLRLPLV; KXCPVQLWV; ATVGMIGV; . 

MIGHLVGV; RLLQETELV; KVAEIVHFL; VVLGWFGI; SMPPPGTRV; 

LTFFWLDRSV; TLMSAMTNL; IMYSAHDTTV; GLPSIPVHPV; LLQERGVAYI; 
25 LLYSLVHNL; MMNDQLMFL; FLTLSVTWIGV; ALGTTCYV; FLTPKKLQCV; 

LLLSIALSV; VLVHPQWVLTV; FLRPRSLQCV; PLVCNGVLQGV. 

49. The multi-epitope construct of claim 39 having the nucleotide sequence 
selected from the group consisting of EP-HIV-1090, HIV-CPT, HTV-FT, HTV-TC, HCV.l, 
30 HCV.2, HCV.3sl, HCV.3s2, HCV.3s2(-3), HCV.3s3, HCV.PC3, HCV.PC4, 

HCV.2431(1P), HCV.4312(1P), AOSLK, HBV.l, HBV.2, PfCTL.1, PfCTL.2, PfCTL.3, 
PQ3, TB.l, BCL A2 #90, BCL A2 #88, Prostate 1, HBV-2A, HBV-2B, HBV-21A, HBV- 
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21B, HBV-30B, HBV-30C, HBV-30CL and a nucleotide sequence that hybridizes to any 
of the foregoing. 

50. The multi-epitope construct of claim 39 having the nucleotide sequence 
5 selected from the group consisting of EP-HIV-1090, HCV.3sl, HCV.3s3, HCV.PC3, 
HCV.PC4, HCV.2431(1P), HCV.4312(1P), HBV.2, PfCTL.l, PfCTL.2, PfCTL.3, Pf33, 
TB. 1, BCL A2 #90, BCL A2 #88, Prostate 1, and a nucleotide sequence that hybridizes to 
any of the foregoing. 

10 51. The multi-epitope construct of claim 39 having the nucleotide sequence of 

EP-HIV-1090 or a nucleotide sequence that hybridizes EP-HIV-1090 under stringent 
conditions. 

52. A multi-epitope construct comprising a plurality of HTL epitope nucleic 
15 acids and a plurality of spacer nucleic acids, wherein: 

the HTL epitope nucleic acids encode class II HLA epitopes of about seven to about 
seventeen amino acids in length; 

the spacer nucleic acids are positioned between the HTL epitope nucleic acids; and 
the spacer nucleic acids encode five or more amino acids, wherein each of the 
20 spacer nucleic acids optimizes epitope processing and minimizes junctional epitopes. 

53 . The multi-epitope construct of claim 52, wherein the spacer nucleic acids 
encode an amino acid sequence having alternating glycines and prolines 

25 54. The multi-epitope construct of claim 53, wherein one or more spacer nucleic 

acids encode the amino acid GPGPG. 

55 . The multi-epitope construct of claim 54, wherein every spacer nucleic acid 
encodes the amino acid GPGPG. 

30 

56. The multi-epitope construct of claim 52, wherein the construct comprises 
five or more HTL epitope nucleic acids encoding class H HLA epitope peptides selected 
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from the group consisting of MGTSFVYVPSALNPAD; LCQVFADATPTGWGL; 
RHYLHTLWKAGILYK; PHHTALRQAILCWGELMTLA; ESRLWDFSQFSRGN; 
PFLLAQFTSAICS W; LVPFVQWFVGLSPTV; LHLYSHPELGFRKI; 
SSNLSWLSLDVSAAF; LQSLTNLLSSNLSWL; AGFFLLTPJLTIPQS; 
5 VSFGVWIRTPPAYRPPNAPI; VGPLTVNEKRRLKLI; KQCFRKXPVNRPIDW; 
AANWILRGTSFVYVP; KQAFTFSPTYKAFLC; AKFVAAWTLKAAA. 

57. The multi-epitope construct of claim 52, wherein the construct comprises 
five or more HTL epitope nucleic acids encoding class II HLA epitope peptides selected 

1 0 from the group consisting of KRWHLGLNKIVRMY; WEFVNTPPLVKLWYQ; 

QKQITKIQNFRVYYR; KVYLAWVPAHKGIGG; GEIYKRWULGLNKI; 

EKVYLAWVPAHKGIG; QHLLQLTVWGIKQLQ; QGQMVHQAISPRTLN; 

SPAIFQSSMTKILEP; DCQFhNMWQEVGKAMY; FRKYTAFTIPSINNE; 

HSNWRAMASDFNLPP; KTAVQMAVFIHNFKR; YRKILRQRKIDRLID; 
15 WAGIKQEFGIPYNPQ; EVNWTDSQYALGII; AETFYVDGAANRETK; 

GAWIQDNSDDCWP; MRKLAILSVSSFLFV; MNYYGKQENWYSLKK; 

SSVFNWNSSIGLIM; RHNWVNHAVPLAMKLI; PDSIQDSLKESRKLN; 

KCNLYADSAWENVKN; VKNVIGPFMKAVCVE; KYKIAGGIAGGLALL; 

GLAYKFWPGAATPY; KSKYKLATSVLAGLL; AGLLGNVSTVLLGGV; 
20 LLIFHINGKIIKNSE; QTNFKSLLRNLGVSE. 

58. The multi-epitope construct of claim 52 having the nucleotide sequence of 
EP-HTV-1043 or a nucleic acid that hybridizes to EP-HTV-1043 under stringent conditions. 

25 59. The multi-epitope construct of claim 52 having the nucleotide sequence of 

EP-HTV-1043 PADRE® or a nucleic acid that hybridizes to EP-HTV-1043 PADRE® under 
stringent conditions. 

60. The multi-epitope construct of claim 52 having the nucleotide sequence of 
30 HTV 75mer or a nucleic acid that hybridizes to HTV 75mer under stringent conditions. 
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61. The multi-epitope construct of claim 52 having the nucleotide sequence of 
PfHTL or a nucleic acid that hybridizes to PfHTL under stringent conditions. 

62. The multi-epitope construct of claim 52 having the nucleotide sequence of 
HBV-HTL or a nucleic acid that hybridizes to HBV-HTL under stringent conditions. 

63. A multi-epitope construct comprising fifteen or more epitope nucleic acids 
and ten or more spacer nucleic acids, wherein: 

the epitope nucleic acids encode class I HLA epitopes or class II HLA epitopes; 

the epitope nucleic acids encode epitope peptides of about seven to about seventeen 
amino acids in length; 

the spacer nucleic acids are positioned between the epitope nucleic acids; and 

the spacer nucleic acids encode between one and eight amino acids when inserted 
between the class I HLA epitope nucleic acids and five or more amino acids when inserted 
between the class II HLA epitope nucleic acids. 

64. The multi-epitope construct of claim 63 having twenty or more epitope 
nucleic acid sequences and fifteen or more spacer nucleic acid sequences. 

65. The multi-epitope construct of claim 63 having twenty-five or more epitope 
nucleic acid sequences and eighteen or more spacer nucleic acid sequences. 
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MaxInsertions= s {enter value here} 208 
OutputToScreen=yes/no 210 
OutputToFiIe=yes/no 212 
MinimumAccepted= {enter value here} 214 
MaxDuplicateFunctionValues={enter value here} 216 
MaxSearchTime (min.)= {enter value here} 218 
Exhaustive=yes/no 220 
NumStochasticProbes= {enter value here} 222 
MaxHitsPerProbe={enter value here} 224 
RandomProbeStart=yes/no 226 
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Junctional Analyzer run on Saturday, February 26, 2000 09:06:23 pm. 
The following non-zero AA weights will be used. 
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The following 1 0 motif specifications will be used to search for junctionals. 
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QutputToScreen = No 
OutputToFile = Yes 
MinlmumValueAccepted = 0 
MaxDuplicateFunctionValues = 50 
SearchTime = 5 
NumStochasticProbes = 10 
MaxHitsPerProbe = 25 
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EP-HIV-1090 

MGMQVQIQSLFLLLLWWGSRGKLVGKLNW^ 

KAKFVAAAVTLKAAAKAFPVTIPQVPLGAAKLTP 

AAAAIFQSSMTKKTTLFCASDAKNIPY^ 

WIHNFKNAAAYPLASLRSLFNLTFGWCT^ 

VTLQLPPLKAMTNNPPIPV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTC 

GGAAAGCTGGTGGGCAAACTCAACTGGGCCGGAGCTGCAATCCTGAAGGAGCCCGTCCACGG 

GGTGAATGCCGCTTGCCCTAAAGTCAGCTTCGAACCAATTAAGATCCCCATTCATTACT 

ACCTGCCAAAGCTAAGTTTGTGGCCGCTTGGACC 

GAGGCCCCAGGTGCCTCTGGGCGCCGCTAAACrCACACCACTGTGCGTCACrCTGGGAGCCGC 

TGCAGTGCTGGCAGAGGCCATGTCCCAAGTGAAGGTGTATCTGGCTTGGGTGCCCGCCCACAA 

GGGGGCCGCTGCAGCCATCTTTCAGTCTAGCATGACCAAGAAAACAACTCTGTTCT 

CGACGCTAAGAACATCCCTTATAATCCACAGTCTCAGGGCGTGGTCAAGCATCCCGTGCACGC 

CGGACCTATTGCTAACGTGACCGTGTACTATGGGGTCCCAGTGTGGAAGAAAGCCGCTGCACA 

GATGGCCGTGTTTATTCACAATTTCAAAAACGCCGCTGCATACCCCCTCGCCAGCCT^ 

CCTCTTCAACCTGACATTCGGCTGGTGCT^ 

ATCAATGCTAAAATCCAGAACTTCCGCGTCTACTATAGGAAGGCTGCAGTGACTATCAAAATT 

GGCGGACAACTGAAGAAAGTGCCTCTCCAGCTGCCCCCTCTCAAGGCAATGACCAACAATCC 

CCCTATCCCAGTCTGA 

HIV-CFr 

MGMQVQIQSLFLLLLWVPGSRGIPIHYCAPAK^ 

TLKAAAKVPLQLPPLKAIFQSSMTK^ 

PQSQGVVKAILKEPVHGVGAAALTC^ 

KVTVYYGVPVWKKAAHPVHAGPIANAAAYPLASLRSLFN^ 

AAAFPVRPQVPLNMTNNPPIPV 

ATGGGGATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 

GGAATCCCCATTCACTACTGCGCCCCTGCTAAGGCAGCCAAAATCCAGAACTTCAGGGTGTAT 

TACAGAAAGGCTGCAGTCACCATTAAAATCGGCGGACAACTGAAGAAAGCCAAGTTTGTGGC 

CGCTTGGACACTCAAGGCCGCTGCAAAGGTCCCACTGCAGCTCCCCCCTCTGAAGGCCATCTT 

CCAGAGCTCCATGACTAAGAAACTGACCCCACTGTGTGTGACACTCGGGGCCCAGATGGCTGT 

GTTCATCCATAATTTTAAAGGCGCCAAGGTCTACCTGGCTTGGGTGCCCGCACACAAGAACG^ 

CATTCCTTACAATCCACAGTCTCAAGGAGTGGTCAAAGCTATTCTGAAGGAGCCCGTGCACGG 

GGTGGGCGCCGCTGCACTCACITTCGGATGGTGCTTTAAACTGAAC^ 

CATGAGCCAGGTCAATCGGATCCTGCAGC^CTC 

GGTGTCCTTCGAGCCAATCAAAGTGACCGTGTATTACGGGGTCCCCGTGTGGAAGAAAGCCGC 

TCATCCTGTCCACGCAGGCCCAATCGCCAACGCCGCTGCATATCCCCTCGCCTCTCTGCGCAG 

CCTGTTTAACGCCGCTGCAACAACCCTCTTTTGCGCCT 

AAAGCTGAACTGGGCCAACGCAGCTGCATTCCCTGTGAGGCCACAGGTCCCCCTCAATATGAC 
TAACAATCCCCCTATCCCAGTGTGA 
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HTV-FT - 

MQVQIQSLFLLLLWWGSRGKLVGKLNW^ 
IMAVFIHNFKIPYNPQSQGWTT^ 
EPIK3QNFRVYYRLTFGWCFKXQWLRP 
CAPAKLTPLCVTL 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCT 

CTGGTGGGGAAGCTGAACTGGGCCATGGCCAGCGATTTCAACCTGCCCCCCGTGGCCATCTTC 

CAGAGCAGCATGACCAAGGTGACCATCAAGATCGGGGGGCAGCTGAAGAGGATCCTGCAGCA 

GCTGCTGTTCATCATGGCCGTGTTCATCCACAACTTCAAGATCCCCTACAACCCCCAGAGCrA 

GGGGGTGGTGACCACCCTGTTCTGCGCCAGCGATGCCAAGATCCTGAAGGAGCCCGTGCACG 

GGGTGCAGATGGCCGTCjTTCATCCACAACTTCAAGGGCGCCGCCGTGTTCATCCACAACTTCA 

AGAGGTGCCCCAAGGTGAGCTTCGAGCCCATCAAGATCCAGAACTTCAGGGTGTACTACAGG 

CTGACCTTCGGGTGGTGCTTCAAGCTGCAGGTGCCCCTGAGGCCCATGACCTACAAGATGACC 

AACAACCCCCCCATCCCCGTGACCGTGTACTACGGGGTGCCCGTGTGGAAGGTGCTGGCCGAG 

GCCATGAGCCAGGTGATCCCCATCCACTACTGCGCCCCCGCCAAGCTGACCCCCCTGTGCGTG 

ACCCTG 
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fflV-TC ■ 

MGMQVQIQSLFLLLLWWGSRGYWQATWIPEWKAff 
PIKHPVHAGPIANLTFGWCFKLNKMGGIGGFIKFJ^ 

NQMVHQAISPRGAKLVGKLNWAGAAAIYETYGDTWKAAQWLRPMTYKGAAAVT 

NAAARYLKDQQLLNTLNmSPIN^ 

GVVKALLQLTVWGIGAAILKEPVHGW 

YQEPFKNLKAAAVLAEAMSQVNLVGPTPVNIGAAAEVN^ 

MDDLYKAAAQMAWIHNFKNAATYQIYQEPFKPYNEWTLELKAK^ 

GAAAIWGCSGKLIKVMIVWQVDRNAAKAACWWAGIK^ 

ASDITCLPPVKSLLNATDIAVNVTVYYG 

RSLF 

ATGGGGATGCAGGTGCAGATCCAGAGCCTGTITCTGCTCCTCCTGTGGGTGCCCGGATCTAGA 

GGATACTGGCAAGCTACTTGGATTCCAGAATGGAAAGCTATCm 

AAGGTATACCTGGCATGGGTCCCAGCACACAAGAACGCCGCIT^ 

CCCATTAAACACCCAGTGCACGCAGGGCCAATAGCGAATTTGACATTCGGGTGGTGCTTCAAA 

CTAAACAAAATGATCGGCGGCATTGGAGGCTTTATCAAGTTTAGAGATTACGTGGACCGAT^ 

TATAAAGCCGCTGCCCGTATACTCCAGCAGCTACTATTCATCAACACCACTCTOT 

CAGACGCTAAGAACCAAATGGTACACCAAGCCATAAGCCCTAGAGGAGCCAAGCTCGTAGGG 

AAATTAAATTGGGCGGGTGCAGCAGCAATCTACGAGACTTACGGCGATACCTGGAAAGCAGC 

CCAGGTTCCGTTACGCCCAATGACCTATAAAGGCGCAGCAGCAGTAACAGTTCTAGATGTAGG 

AGACGCTTACAACGCTGCCGCAAGATACCTAAAAGATCAGCAGCT 

CCCAATTAGCCCGATAAACATGACAAATAACCCACCAATTCCCGTCAATGCTCCCTACAACAC 

TCCAGTATTCGCAATCAAAGCCGCTGCTGTCCCCCTGCAGCTCCCTCCTCT 

ACCITACAACCCACAGAGCCAAGGTC 

TGGTGCTGCAATTCTAAAAGAGCCAGTTCATGGGGTTAACGCCGCCGCCTTCCCAATCAGTCC 

TATTGAGACTGTGAAAGTATGGAAAGAAGCCACAACCACACirTITAAGGCAGCC^ 

CAATTAAAATAGGGGGCCAACTTAAGAAAATATACCAGGAACCTTTCAA 

GCTGCAGTGCTCGCCGAGGCTATGTCACAGGTGAATTTGGTCGGACCAACACCCGTAAACATC 

GGAGCCGCAGCCGAAGTGAACATAGTCACCGACTCACAGTACAAAGCCGCTGCAATACCCAT 

ACATTATTGTGCTCCCGCAAAGGCCGTGATCTATCAATATATGGACGACCTGTATAAGGCCGC 

CGCGCAGATGGCAGTOTTATCCACAACm 

ACCATTCAAACCGTACAATGAGTGGACCTO 

TATATTATAGAAAAGCATTTCCAGTGAGGCCCCAGGTGCCTCTGGGTGCCGCAGCAATATGGG 
GATGTTCTGGAAAACTGATCAAGGTGATGATTGTATGGCAAGTGGACAGAAATGCAGCTAAG 
GCAGCCTGTTGGTGGGCAGGTATAAAAGCAAAGTTCGTGGCAGCATGGACGCTTAAAGCAGC 
CGCAAAACTCACTCCTCTCTGCGTGA 

CCTGTAAAATCCCTGCTTAATGCGACAGATATCGCAGTCAACGTAACAGTATATTATGGCGTG 

CCAGTCTGGAAAAAAGCCGCCGCGGCCATAATTCGGATACTGCAGCAGCTGAAAAGAGCTAT 

GGCGAGTGACITCAACCTGAATGCGGCCGCCTACCCCT^ 

A 
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HCV.l 

MGMQVQIQSLFLLLLWVPGSRGLLFNILGGWVDLMGYIP^ 
DLGVHMWNFISGIYLLPRRGPRLYLWRHADVVLVGGVLA^ 

IAFACTCGSSDLYLSAFSUISYGVAGALVAFKLPGCSFSIFKTSERSQPRLIFCHSKX^ 
OTIPFYGKAIRMYVGGVEHRQLFTFSPRRRLGVRATOKV 

GAATTCGCCGCCACCATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCC 

GGATCCAGAGGACTGCTGTTCAACATCCTGGGGGGGTGGGTGGATCTGATGGGGTACATCCCC 

CTGGTGTACCTGGTGGCCTACCAGGCCACCGTGATCCTGGCCGGGTACGGGGCCGGGGTGAG 

GCTGATCGTGTTCCCCGATCTGGGGGTGCACATGTGGAACTTCATCAGCGGGATCTACCTGCT 

GCCCAGGAGAGGACCTAGACTGTACCTGGTGACTAGACACGCTGATGTGGTGCTGGTGGGAG 

GAGTGCTGGCTGCTCTGCTGTTTCTGCTGCTGGCTGATGCTTTCCT 

AGTGTGGATGAACAGACTGATCGCTTTCGCTTGTACATGTGGAAGCTCCGATCTGTATCTGAG 

CGCTTTCAGCCTGCACAGCTACGGAGTGGCTGGAGCTCTGGTGGCTTTTAAGCT 

TAGCTTTAGCATCTTTAAGACCAGCGAAAGAAGCCAGCCTAGACTGATCnT^ 

GAAGAAGTTTTGGGCTAAGCACATGTGGAATTTTATCCCTTTCTATGGAA^ 

GTATGTGGGAGGAGTGGAACACAGACAGCTGTTTACATTTAGCCCTAGAAGGAGACTGGGAG 

TGAGAGCTACAAGAAAGGTGGGAATCTATCTGCTGCCTAATAGATGAAAGCTTGGG* 



HCV.2 

MGMQVQIQSLFLLLLWVPGSRGDLMGYIPLVAKFVAAWTLKAAALLFLLLADA^ 

TFSPRJRYLVTRHADVYLIJPRRGPRLCTCGSSDLYHMWNHS 

AIIAGYGAGWLVAYQATVGVAGALVAFKIPFYGKAIRMYVGGV^ 

RVLPGCSFSIFAKFVAAWTLKAAAKTSERS 

LHSYLLFNILGGWWGIYLLPNR* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGG 
GTGCCCGGATCCAGAGGAGATCTGATGGGATATATCCCTCTGGTGGCTAAGTTTGTGGCTGCT 
TGGACACTGAAGGCTGCTGCTCTGCTGTTTGTGCTGCTGGCTGATGCT 

GCAAGAAGAAGCAGCTGTTTACATTTAGCCCAAGAAGATATCTGGTGACAAGACACGCTGAT 

GTGTATCT GCTGC CTAGACGCGGACCTAGACTGTGTACATGTGGAAGCTCCGATCTGTATCAC 

ATGTGGAACirTATCAGCGGAATCTTTTGGGCTAAGCACATGTGGAATTT 

TATGGAGCTGGAGTGTATCTGGTGGCTTATCAGGCTACAGTGGGAGTGGCTGGAGCTCTGGTG 

GCTTTCAAGATCCCATTCTATGGAAAGGCTATCAGAATGTATGTGGGAGGAGTGGAACACAG 

AGTGCTGGTGGGAGGAGTGCTGGCrGCTTTCCTGCTGCTGGCTGATGCTAGAGTGCTGCCAGG 

ATGTAGCTTTAGCATCTTCAAGACTTCCGAACGCTCCCAGCCTAGAAGACTGGGAGTGAGAGC 

TACAAGGAAGAGACTGATCGTGTTTCCAGATCTGGGAGTGTGGATGAATAGACTGATCGCTTT 

CGCTCTGAGCGCTTTCAGCCTGCACAGCTATCTGCTGTTCAACATCCTGGGAGGATGGGTGGT 

GG GAATCTATCTG CTGCC AAAC AG ATG AAAGCTT 

HCV.3sl 

MGMQVQIQSLFLLLLWWGSRGYLVAYQATV^ 
VTRHADVLGFGAYMSKCTCGSSDLYHMWNHSGIFWAKHMWNF* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGG 

GTGCCCGGATCCAGAGGATACCTCGTCGCCTACCAGGCCACTGTGGCTAAATTCGTGGCAGCC 

TGGACACTGAAAGCTGCAGCTCTGCTCTrCCTGCTCCTGGCCGATGCACTCATCTTCT 

CCAAGAAAAAGTATCTGGTCACCAGACATGCTGACGTGCTGGGGTTTGGCGCCTACATGAGC 

AAGTGCACCTGTGGCAGCTCCGACCTGTATCA 

CCAAGCACATGTGGAATTTCTGAAAGCTT 
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HCV.3s2 

MGMQVQIQSLFLLLLWVPGSRGVLVGGVL 
AGYGAGVWMNRLIAFAIPFYGKAIVAGALVAFKVGIYLLPNR* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCT 

GTGCCCGGATCCAGAGGAGTCCTGGTGGGCGGCGTCCTGGCCGCTGCTAAGTTTGTCGCTGCT 
TGGACACTGAAGGCAGCCGCnTTCCTGCTCCTGGCAGACG 

CTCCACTCCTACATCCTCGCAGGGTATGGCGCAGGCGTGTGGATGAATCGGCTGATCGCCTTT 
GCCATTCCATTCTATGGGAAAGCCATTGTGGCTGGCGCCCTGGTGGCATTCAAGGTCGGGATC 
TACCTCCTGCCTAACCGCTGAAAGCTT 

HCV.3s2(-3) 

MGMQVQIQSLFLLLLWWGSRGVLVGGVLAAAKFVAAWTI^^ 
AGYGAGVWMNRLIAFA* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGG 
GTGCCCGGATCCAGAGGAGTCCTGGTGGGCGGCGTCCTGGCCGCTGCTAAGTTTGTCGCTGCT 
TGGACACTGAAGGCAGCCGCTTTCCTGCTXOT 

CTCCACTCCTACATCCTCGCAGGGTATGGCGCAGGCGTGTGGATGAATCGGCTGATCGCCTTT 
GCCTGAGGATCC 

HCV3s3 

MGMQVQIQSLFLLLLWWGSRGDIMGYIPLVAKFVTA^ 
RMYVGGVEHRRLIWPDLGVGVAGALVAF^ 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCT 

GTGCCCGGATCCAGAGGAGACCTGATGGGCTACATCCCTCTCGTGGCCAAGTTTGTGGCAGCT 

TGGACCCTGAAGGCCGCTGCCAGACTGGGAGTGCGCGCTACACGGAAACTCCTGTTTAA 

CTGGGAGGGTGGGTGCGGATGTACGTCGGAGGCGTCGAGCACAGAAGGCTCATTGTCTTTCC 

AGATCTCGGCGTGGGCGTCGCAGGCGCACTCGTGGCCTTCAAACTGCCAGGGTGCAGCTTCAG 

CATTTTCAAGACCTCCGAACGCTCCCAArc^ 

CTGCTGCCCAGACGCGGACCCAGGCTGTGAAAGCTT 

HCV.PC3 

MGMQVQIQSLFLLLLWVPGSRGLLFMLGGWVKAKFVAAWTLKAAALADG 
LGVKFSVAKHMWNFIGVAGALVAFKKQLFTFSPRR* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGG 

GTGCCCGGATCCAGAGGACTGCTCTTCAACATCCTGGGCGGATGGGTGAAGGCCAAGTTCGTG 

GCTGCCTGGACCCTGAAGGCTGCCGCTCTGGCCGACGGGGGATGCAGCGGCGGAGCTTACAG 

GCTCATTGTCTTTCCCGATCTCGGAGTCAAATTTTGGGCAAAGCACATGTGGAAm 

GGTGGCCGGAGCCCTGGTCGCTTTTAAAAAGCAGCTCTTCACCI^ 

TACC 
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HCV.PC4 

MGMQVQIQSLFLLLLWWGSRGf^^ 

DLMGYIPLVKYLLPRRGPRLWLCGFADmGYRMYVGGVEHR* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGG 

GTGCCCGGATCCAGAGGAAGGCTGGGCGTGAGAGCCACCCGGAAGAAGGCCAAGTTCGTGGC 

TGCCTGGACCCTGAAGGCTGCCGCTAAAACAAGCGAGCGCTCCCAGCCCAGGAACCTGCCTG 

GATGCTCTTTCAGCATCTTTAATGACCTCATGGGGTACATTCCACTGGTGAAGTATCTGCT 

CAGACGGGGCCCTCGCCTGAACACTCTCTGTGGATTTGCTGATCTGATGGGGTACAGGATGTA 

TGTCGGCGGAGTCGAACACAGATGAGGTACC 

HCV.243U1P^ 

MGMQVQIQSLFLLLLWWGSRGVLVGGV^^ 

IAFAGAAARLGVRATRKKAAAKTSERSQPRl^ 

FADLMGYRNfYVGGVEHRJKXLFNILGGWVKAAA^ 

HGVAGALVAFKKQLFITSPRRNGYLVAYQATVAAALLFLLLADALIFCHSKKKYLV^ 
FGAYMSKCTCGSSDLYHMWNFISGWWAJ&m^ 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGT1TCTGCTCCTCCTCT 

GTGCCCGGCTCCAGAGGAGTCCTGGTGGGCGGCGTCCTGGCAGCCGCTTTCCTGCTCCTGGCA 

GACGCCAGGGTGCTGTCTGCCTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGCGCAGGC 

GTGTGGATGAATCGGCTGATCGCCTTTGCCGGCGCTGCCGCAAGGCTGGGCGTGAGAGCCACC 

CGGAAGAAGGCTGCCGCTAAAACAAGCGAGCGCTCCCAGCCCAGGAACCTGCCTGGATGCTC 

TTTCAGCATCTTTAATGACCTCATGGGGTACATTCCACTGGTGAAGTATCTGCTCCCCAGACGG 

GGCCCTCGGCTGAACACTCTCTGTGGATTTGCTGATCTGATGGGGTACAGGATGTATGTCGGC 

GGAGTCGAACACAGAAAACTGCTCTTCAACATCCTGGGCGGATGGGTGAAGGCTGCCGCTCT 

GGCCG ACGGGGGATGCAGCGGCGGAGCTTACAGGCTCATTGTCTTTCCCGATCTCGGAGTCAA 

ATTTTGGGCAAAGCACATGTGGAATTTCATCGGGGTGGCCGGAGCCCT 

GCAGCTCTTCACCTTCTCCCCAAGACGGAACGGATACCTCGTCGCCTACCAGGCCACTGTGGC 

TGCAGCTCTGCTCITCCTGCrCCTGGCCGATGCACTCATCTTCrGCCATTCCAAG 

CTGGTCACCAGACATGCTGACGTGCTGGGGTTTGGCGCCTACATGAGCAAGTGCACCTGTGGC 

AG CTCC GACCTGTATCACATGTGGAACTTTAITrCTGGAATCTTTTGGGCCA^ 

AATTTTAAGGCCGCAGCAGCTAAATTCGTGGCAGCCTGGACACTGAAAGCAGCTGCATGAGG 

ATCC 
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HCV.4312flP) 

MGMQVQIQSLFLLLLWVPGSRGRLGVRATRKKAAAK^^ 

YLLPRRGPRLNTLCGFADLMGYRMYVGGVEHRKLLFNILGGW 

DLGVKFWAKHMWNFIGVAGALVAFKKQLFTFSPRRN^ 

KKKYLVTRHADVLGFGAYMSKCTCGSSDLYHMWNFISGIFW 

AFLLLADARVLSAFSLHSYILAGYGAGVWM^ 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGG 

GTGCCCGGCTCCAGAGGAAGGCTGGGCGTGAGAGCCACCCGGAAGAAGGCTGCCGCTAAAAC 

AAGCGAGCGCTCCCAGCCCAGGAACCTGCCTGGATGCTCTITCAGCATCITrAATG 

GGGGTACATTCCACTGGTGAAGTATCTGCTCCCCAGACGGGGCCCTCGCCTGAACACTCTCTG 

TGGATTTGCTGATCTGATGGGGTACAGGATGTATGTCGGCGGAGTCGAACACAGAAAACTGCT 

CTTCAACATCCTGGGCGGATGGGTGAAGGCTGCCGCTCTGGCCGACGGGGGATGCAGCGGCG 

GAGCTTACAGGCTCATTGTCTTTC^ 

ATTTCATCGGGGTGGCCGGAGCCCTGGTC^ 

GACGGAACGGATACCTCGTCGCCTACCAGGCCACTGTGGCTGCAGCTCTGCTCTTCCTGCT 
TGGCCGATGCACTCATCTTCTGCCATTCCAAGAAAAAGTATCTGGTCACCAGACATGCTGACG 
TGCTGGGGTTTGGCGCCTACATGAGCAAGTGCACCTGTGGCAGCTCCGACCTGTATCACATGT 
GGAACTTTATTTCTGGAATCITITGGGCCAAGCACATGTGG 

TCCTGGTGGGCGGCGTCCTGGCAGCCGCTTTCCTGCTCCTGGCAGACGCCAGGGTGCT 
CCTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGCGCAGGCGTGTGGATGAATCGGCTGA 
TCGCCTTTGCCAATGCTGCAGCTAAATTCGTGGCAGCCTGGACACTGAAAGCAGCTGCATGAG 
GATCC 



AOSI.K 

MGMQVQIQSLFLLLLWWGSRGHTLWK^ 
LYMDDVVLGVGIi>RYVARLFLLTRILT^ 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTITCTGCTCCrCCTGTGGGTGCCCGGGTCCAGA 
GGACACACCCTGTGGAAGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCTGCCTGGACCCTG 
AAGGCTGCCGCnTTCCTGCCTAGCGATTTCTTTCCTAGCG 

TCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCCAGGTACGTGGCTAGGCTGT 
TCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGCAGG 
CCTTCACCTTTAGCCCTACCTATAAGTGGCrGAGCCTGCTGGTGCCCTTTGTGTGA 

HBV.l 

MGMQVQIQSIJLLLLWWGSRGHTLW^^ 
YMDDVVLGVGLSRYVARLFLLTRILTISTLPETIVVRRQAFITS^ 
ARVTGGVFKVGMnTGLYLPSDFFPSVTLWKAGILYKNV 
MPLYACI 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGA 

GGACACACCCTGTGGAAGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCTGCCTGGACCCTG 

AAGGCTGCCGCTTTCCTGCCTAGCGATTTCTTTCCTAGCGTGTTCCTGCT 

ACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCCAGGTACGTGGCTAGGCTGTTCC 

TGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGCAGGCCT 

TCACCTITAGCCCTACCTATAAGTGGCTGAGCCTGCT 

CTCCTGGGCTTTCACCCCAGCCAGGGTGACCGGAGGAGTGTTTAAGGTGGGAAACTTCACCGG 
CCTGTATCTGCCCAGCGATTTCTTTCCTAGCGTGACCCTGTGGAAGGCCGGGATCCTGTACAA 
GAATGTGTCCATCCCITGGACCCACAAGCTGGTGGTGGACTTTTCCCAGTTCAGCAGATCCGC 
TATCTGCTCCGTGGTGAGGAGAGCTCTGATGCCACTGTATGCCTGTATCTGA 
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HBV.2 • - 

MGMQVQIQSLFLLLLWWGSRGHTLWKAGILYKAKFVAAWTLKAAAFLP 
LYMDDWLGVGLSRYVARLFLLTRILTISTIJPETTWRRQAFTF 
SSWAFKTPARVTGGVFKVGNFTGLYNLPSDFFPSVKTLWKAGILYKNV 
FSRNSAICSWRRALMPLYACI 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGA 
GGACACACCCTGTGGAAGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCTGCCTGGACCCTG 
AAGGCTGCCGCTTTCCTGCCTAGCGATTTCnriTCCTAGCGTGAA 

TCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCCAGGTACGTGGCTAGGCTGT 

TCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGCAGG 

CCITCACCTTTAGCCCTACCrATAAGGGAGCCGCTGCCTGGCTGAGCCTGCT 

GAATATCCCTATCCCTAGCTCCTGGGCTTTCAAGACCCCAGCCAGGGTGACCGGAGGAGTGTT 

TAAGGTGGGAAACTTCACCGGCCTGTATMCCTC^ 

CCTGTGGAAGGCCGGAATCCTGTACAAGAATGTGTCCATCCCTTGGACCCACAAGGGAGCCG 
CTCTGGTGGTGGACTTTTCCCAGTTCAGCAGAAATTCCGCTATCTGCTCCGTGGTGAGGAGAG 
CTCTGATGCCACTGTATGCCTGTATCTGA 

PfCTL.l 

MQVQIQSLFLLLLWWGSRGD^VSSFLFW^ 

KKAAAAKFVAAWTLKAAAKAFMKAVCVEVNAAASFLFVEAL 

TSVLKAGVSENIFLKNAAAYFILVNLL1KAGLLGVVSTV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 
GGAATCCTGAGCGTGTCCTCITrCCTGTTTGTCAACGCCGCTGCACAG 

CTCCTGAGGAACCTCCCCTCCGAGAACGAAAGAGGCTACAAAGCCGCTGCACTGCTCGCCTGC 
GCTGGACTGGCCTATAAGAAAGCCGCTGCA^ 

CGCTGCAAAAG CCTTTATG AAGG CTGTCTGTGTGGAGGTC AATGCCCKIITGCATCTTTCCTGTTT 

GTGGAGGCCCTCnTTAACGCTACTCCm'ACGCAGGGGAACCAGCCCCCTTCAAGGCCGCT 

AAATATAAGCTGGCAACCAGCGTGCTGAAGGCTGGCGTGTCCGAGAATATTTTTCT 

GCCGCTGCATACITCATCCTGGTGAATCTGCrCATTAAGGCCGGACTCCTG 

ACAGTGTGA 

PfCTL.2 

MQVQIQSLFLLLLWWGSRGFVEALFQEYN^ 

ALFFIIFNKNAAAKJFVAAWTIi^ 

GLVLNFLIFFDLFLVKAVLAGLLGW 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 

GGATTCGTGGAGGCCCTGTTTCAGGAATACAACGCCGCTGCAAAGTATCTCGTCATCGTGTTC 

CTGATCAATGCTCTGGG\TGCGCCGGCCT^^ 

TGCTCAAGGCCGCTCTGTrCTTTATCATTTrCAATAAAAACGCC^ 

ATGGACCCTGAAGGCCGCTGCAAAATTCATCCTCGTGA^ 

GACGAGGAAAATATCGGAATTTATAAGCTGCCCTACGGGAGGACAAACCTGAAAGCCGCTGC 

AGTCCTGCTCGGCGGAGTGGGGCTGGTGCTCAATTTTCTGATCITCTTT 

AAGGCCGTCCTGGCCGGCCTGCTCGGAGTCGTGTGA 



FIGURE 18H 



WO 02/083714 



31/60 



PCT/US02/09877 



PfCTL.3 . 

MQVQIQSLFLLLLWVPGSRGWLIFFDLFLNAA^ 

NEIKAHVLSHNSYEKNYYGKQEN\VTSLKXILSW 

VAAWTLKAAAKAAAYYIPHQSSLKAAAGLIMVLSFL 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 
GGAGTGTTCCTGATCTTCTTTGACCTG^ 

AATCTCTACAAGGCCGCTGCAGTGACCTGTGGAAACGGGATTCAGGTCAGGAAACTC 
ATCTTCGACGGCGATAACGAGATCAAGGCCCATGTGCTGTC 

TACTATGGAAAGCAAGAGAATTGGTACAGCCTGAAGAAAATTCTGTCCGTGTTCm 

AACGCC GCTGC AAAGTTTATCAAGTCrCTGTTCCATATTTTCAAGGCCG 

GCTTCTATTTTATTAAAGCCAAATTTGTGG 

CTGC ATACTATATCCCTCACCAGAGCTCCCTGAAGGCCGCTGCAGGGCTGATCATGGTGCTCT 
CTTTCCTGTGA 

PfCTIVHTLfN) 

MQVQIQSIJLLLLWWGSRGSSVFNVWSSIGLIMVLSFLG^ 

SEGPGPGPDSIQDSLKESRKLSGPGPGVLAGLLGWSTVLLGGVGLVLGPGPGLPSENERGYYIPHQ 
SSLGPGPGQTOTKSLIJRNLGVSENIFIJ£^ 

GKJFIKSLFHIFDGDNEIGPGPGKSKYKLATS\aAGLIXjPGPGIJPY 

AMKLIGPGPGMRKXAII^VSSFLFV^BAIJFQEYGPGPGVTCGNGIQVRGPGPGMNYYGKQE 
KKGPGPGPSDGKCNLYADSAWEm'KmaGPFA^ 

HVLSHNSYEKGPGPGKYKIAGGIAGGLALLACAGIAYXFVVPGAATPYAGEPAPF 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 
GGAAGTAGTGTGTTCAATGTTGTGAACTCATC^ 

GGCCAGGGCCAGGATTATATATTTCTTTCTACTTCATCCTrGTCAACCTG 

TAACGGCAAAATAATAAAGAACAGTGAAGGCCCTGGGCCTGGGCCTGACTCGATCCAGGATT 

CTCTAAAAGAATCGAGGAAGCTCTCCGGACCAGGCCCTGGTGTACTCGCCGGGTTGCTGGGA 

GTAGTTAGCACAGTGCTGTTAGGAGGCGTCGGCCTCGTCTTAGGACCTGGACCAGGTCTGCCG 

TCCGAAAACGAAAGAGGATACTACATACCTCACCAGAGCAGCCTCGGCCCAGGCCCCGGACA 

AACCAATTTCAAATCCCTCnTGCGAAATCTAGGAGTGAGCGAGAACATATTTCIT 

CGGTCCCGGCnTTCAGGACGAGGAGAATATAGGTATTTACGGTCCAGGACCTGGAAAATACCT 

AGTGATCGTATTCCTAAllllirilGACCTATTTCTGGTGGGCCCAGGTCCCGGAAAGTTCATT 

AAATCACTCTTCCACATTTTTGACGGAGATAACGAGATAGGACCCGGTCCCGGGAAATCA 

GTACAAACTAGCCACTTCAGTGCTGGCCGGCCTTCTAGGGCCGGGCCCAGGGCTCCCCTATGG 

AAAGACAAATCTTGGCCCCGGTCCAGGACGGCACAACTGGGTGAATCATGCGGTTCCATTGG 

CCATGAAACTAATCGGGCCCGGTCCAGGCATGCGCAAACITGCAATTCTAAGCGTAAGTTCAT 

TTCTGTTCGTAGAGGCACTGTTTCAAGAATATGGCCCAGGACCTGGCGTCACATGTGGGAATG 

GGATCCAGGTGAGAGGACCGGGACCTGGTATGAACTATTACGGTAAACAGGAAAATTGGTAC 

TCCCTGAAAAAGGGTCCAGGCCCCGGCCCCTCAGATGGTAAGTGCAACCTGTATGCTGACTCA 

GCATGGGAGAACGTAAAAAATGTAATAGGCCCATTCATGAAGGCAGTTTGTGTCGAAGTCGG 

ACCAGGCCCAGGAAAAATACTTTCTGTCITCTTCCTAGCTCT 

GGACCAGGGCCAGGTCACGTGTTATCCCATAACTCITATGAAAAAGGGCCAGGACCTGGG 
ATACAAAATCGCAGGAGGGATCGCCGGCGGGCTAGCGCTCCTTGCCTGCGCAGGCTTGGCTTA 
CAAATTCGTTGTACCAGGAGCTGCAACACCCTATGCAGGAGAACCTGCCCCATTTTGAAGATC 
TGC 



FIGURE 181 



WO 02/083714 



32/60 • 



PCT/US02/09877 



Pf33 

MGMQVQIQSDFLLLLWVPGSRGFMKAV^^ 

EIKAALLACAGLAYKICSFLFVEALFNAAPSDGKChfiLYKAAQTNFKS 

EMFLKNAAAYFILVNIJLJKAAAJLSVSSF^ 

FLNYYIPHQSSUCAAGLLGWSWGAVLLGGVGLVLNLACAGL^ 

VNLLKAFLIFFDLFLVKAI^ 

ILSVFFIJWAVLAGLLGNVNFQDEENIGIYKAAALYISFYFIKAFI^ 
HVLSHNSYEKNAAAKYLVIVFLr 

GCCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGT 

GGATCCAGAGGATTTATGAAAGCTGTCTGTGTAGAGGTGAATGTAACATGCGGTAACGGAAT 

TCAGGTGAGAAAGGGACTCATCATGGTACTCAGCTTTCTGAACGCAGCCCTGTTCCACATCTT 

TGACGGAGACAATGAAATCAAAGCCGCATTGCTCGCCTGTGCCGGACTAGCCTATAAAAAGA 

GTTTCCTTTTCGTTGAAGCACTATTTAACGCAGCACCCAGTGACGGTAAATGCAACCT 

AAGCAGCTCAGACTAATTTCAAAAGCCTGTTAAGAAATCTGCCCTCAGAGAATG 

TACAAAGCCGCCGGCGTGTCCGAGAATATTTTC^ 

GTGAATCTACTCATAAAGGCAGCCGCAATCCTTTCAGTGTCCAGCTTTCTGm 

CATATGCGGGCGAGCCGGCTCCTTTCAAGGCTGCAGCAAAATACAAGCTTGCCACATCAGTAT 

TGAAAGCAGCTGTGTTTTTGATATTCTTTGATCrri'l 'J Tl AAACTACTACATACCTCATCAGTCT 

AGTCTTAAAGCAGCCGGGCTACTGGGGAACGTCTCTACTGTGG 

GTTGGCCrCGTGTTGAACCTCGCGTGCGCAGGTCrGGCCTACAAAAAAGCGA 

TCTCTGTTCCACATrTTTAAAGC^^ 

GATCITC1TCGATCTATTCCTCGTAAAAGCGCTATTCTTCATT 

GGCAAGCAAGAAAATTGGTACTCACTCAAGTITGTAGAAGCTCTGTTCCAGGAATACAACGCC 
GCTGCTAAATTCGTTGCAGCTTGGACCCTGAAAGC^^ 

TCGCTAATGCCGTATTAGCAGGACTTCTAGGCAACGTGAACTTTCAAGACGAAGAGAATATAG 
GCATCTACAAAGCCGCAGCACTGTACATTTCATTCTACTC 

CCTTCTGATATTTCATAATGCAGCACTGCCATATGGGAGAACCAACTTGAAAGCGGCCCACGT 
GTTGAGCCACAACTCCTACGAGAAGAACGCCGCCGCGAAATATCrCGTCATTGTCTTCCrGAT 
TTGA 



TB.l 

MQVQIQSLFLLLLWWGSRGRMSRVTTFIYKALVLLMLPV^ 

GLMTAVYLVGAAAMALLRLPVK^^ 

IJCAAAKAAARLMIGTAAAGFWALIPLVNAMTYAAPL^ 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAGG 

ATGAGCAGAGTGACCACATTCACTGTCAAGGCCCTGGTGCTCCTGATGCTCCCCGTCGTGAAC 

CTGATGATCGGCACCGCTGCAGCCGTCGTGAAAGCTCTCGTCCTGCTCATGCTCCCTGTGGGA 

GCAGGGCTGATGACAGCCGTGTACCTGGTCGGCGCTGCAGCCATGGCCCTCCTGCGGCTGCCA 

GTGAAGCGCATGTTTGCTGCAAATCTGGGAGTCAACTCCCTCTATTTCGGGGGCATTTGCGTG 

GGAAGGCTGCCCCTCGTGCTGCCTGCTGTGAATGC 

ACTCTGAAGGCAGCCGCTAAGGCCGCTGCAAGACTGATGATCGGGACCGCCGCTGCCGGCTT 
CGTGGTCGCCCTGATTCCCCTGGTGAACGCCATGACATACGCAGCTCCTCTGTTTGTGGGAGC 
CGCTGCAGCCATGGCTCTCCTGCGGCTGCCACTGGTGTGA 
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BCL A2 #90 

MQVQIQSLFLLLLWWGSRGIMIGHLVGVNRLLQETELVNAKVAEIVHFLNAKVFGSLAFW 

S G ANLNVG AA YLQLVF GIE VN A AAKF V AAWTLKAAAKAAA V VLG VVFGBSI SMPPP GTRVN AAA 

ATVGIMIGVNAKLCPVQLWV 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGAATT 

ATGATCGGCCATCTGGTGGGCGTCAACAGACTGCTGCAGGAAACCGAGCTGGTGAATGCCAA 

GGTGGCCGAAATTGTGCACTTTCTCAACGCAAAGGTGTTTGGTTCCCTGGCTl^ 

TATCTGAGCGGCGCTAACCTCAACGTCGGAGCCGCCTACCTCCAGCTGGTCTTCGGCATCGAG 

GTCAACGCTGCTGCAAAATTCGTGGCAGCTTGGACCCTCAAGGCTGCAGCAAAGGCTGCCGCC 

GTCGTGCTCGGAGTGGTGTTCGGGATCAACTCTATGCCACCTCCCGGGACTAGGGTCAATGCT 

GCCGCCGCAACAGTGGGAAT(^TGATTGGGGTGAATGCCAAACTGTGCCCAGTGCAACTGTG 

GGTGTGA 

BCL A2 #88 

MQVQIQSLFLLLLWVPGSRGVVLGVWGIN^ 

NVGAAYLQLWGIEVNIMIGHLVGVNRLLQETELVNAKWGSLAFVNAF^ 
GMIGVNSMPPPGTRV 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGAGTC 

GTGCTGGGAGTCGTOTCGGCATTAATGCCGCCGCTGCAAAGTTCGTGGCTGCCTGGACCCTG 

AAGGCCGCAGCTAAAGTGGCAGAGATCGTGCACTTTCTGAACGCCTACCTGAGCGGAGCAAA 

TCTGAACGTCGGCGCTGCCTATCTGCAGCTCGTGTTTGGAATTGAAGTGAACATCAT 

ACATCTGGTGGGCGTGAACAGGCTGCTCCAGGAAACTGAGCTGGTCAACGCTAAAGTGTTCG 

GGTCTCTCGCCTTTGTGAACGCTAAGCTCTGCCCCGTCCAACTCTGGGTCAATGCCGCAGCCG 

CTACAGTGGGGATCATGATCGGCGTGAACTCCATGCCTCCACCAGGGACCAGAGTGTGA 

BCL A2 #63 

MQVQIQSIJLLLLWVPGSRGKLCPVQLWWA 

KVAEIVHFLNAKWGSLAFVNAYLSGANLNVGAAYLQLWGffiVNAAAKFVAA 
AWLGWFGINSMPPPGTRV 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAG 

CTCTGCCCCGTGCAACTGTGGGTCAACGCCGCCGCCGCAACCGTCGGCATTATGATCGGGGTG 

AACATCATGATCGGACACCTGGTCGGCGTGAACAGGCTGCTGCAGGAGACAGAACTGGTCAA 

TGCCAAGGTGGCTGAAATTGTCCATTTCCTGAATGCCAAAGTGTTCGGCTCTCT 

AACGCITATCTGAGCGGAGCTAACCTCAACGTGGGGGCCGCATACCTC 

ATTGAGGTGAATGCCGCAGCTAAATTTGTCGCTGCCTGGACCCTGAAGGCAGCAGCCAAGGCT 
GCCGCAGTGGTGCTGGGAGTGGTGTTTGGAATCAATTCCATGCCTCCACCAGGCACTAGAGTG 
TGAGGATCC 
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Prostate ! . 

LTFFWLDRSVKAAAVLVHPQWVLTVK^ 

AAIMYSAHDTTVKAAAFLTPKKLQCW 

LLWQPIPVOTLRPRSLQCVKAFLTI^VTWIGVK 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGATTG 
ACATTTTTTTGGCTGGATAGATCGGTTAAGGCT 

TGACCGTAAAGGCTGCCGCGCTGCTACAAGAAAGAGGGGTCGCATACATCAAAGCTGCTCTC 
CTCTTGAGTATTGCGCTAAGTGTAAACCCG 

GCGGCGATTATGTACAGTGCCCACGACACTACCGTAAAAGCAGCCGCTTTCCTGACCCCAAAA 
AAACTCCAATGCGTGAACGCAATGATGAATGATCAGCrGATGTTTTTAAA 
TCTATACCGGTTCATCCAGTCAAGGCCGCGGCATTGGGTACGACGTGTTATGTTGGAGCAGCG 
ATACTTCTTTGGCAGCCCATACCAGTAAA 

CATTCCITACACTCTCAGTAACTTGGATCGGAGTCAATGCTCTGCTATATA 

CTTGGGCGCGGCCACACTrATGAGTGCAATGACGAATTTAGCTAAGTTCGTGGCGG 

TCTAAAGGCCGCAGCA 

HIV-1043 

MEKVYLAWWAHKGIGGGPGPGQKQITKIQOT 
KILRQRKIDRLIDGPGPGQHLLQLTV^^ 

VHQAISPRTLNGPGPGIKQFINMWQEVGKAMYGPGPGWAGIKQEFGIPYOTQGPGPGKTAV 

WIHOTKRGPGPGSPAIFQSSMTKILEPGPGPGEVNIVTDS^^ 

GPGPGAETFVVDGAANRETKGPGPGGAVVIQ 

ATGGAGAAGGTGTACCTGGCCTGGGTTCCAGCCCACAAAGGCATCGGGGGAGGGCCCGGACC 

TGGGCAGAAACAGATCACCAAGATCCAGAACTTCCGGGTATACTACCGGGGACCTGGTCCAG 

GTTGGGAGTTTGTGAACACACCACCCTrAGTAAAGCTCTGGTACCAGGGCCCCGGTCCCGGAT 

ACCGTAAAATCCTGAGGCAAAGAAAGATAGATCGCCTCATTGATGGCCCGGGCCCAGGCCAG 

CACCTTCTGCAGCITACAGTGTGGGGAATTAAACAGCTGCAGGGGCCGGGCCCCGGGGGGGA 

AATTTATAAAAGGTGGATCATTCTGGGTCTGAACAAGATCGTCCGCATGTATGGCCCTGGACC 

CGGACAGGGGCAGATGGTCCACCAAGCAATCAGCCCTCGAACCTTGAATGGACCGGGCCCAG 

GAATCAAGCAATTCATTAACATGTGGCAAGAAGTTGGTAACKjCTATGTACGGTCCCGGCCCT 

GATGGGCAGGGATAAAACAGGAGTTTGGAATCCCTTACAATCCCCAGGGTCCTGGGCCAGGT 

AAAACGGCAGTGCAGATGGCCGTGTTCATTCATAATTITAAGCGGGGCCCTGGACCTGGCAGC 

CCAGCTATATTTCAAAGTTCGATGACCAAAATCTTGGAGCCCGGCCCAGGGCCGGGCGAAGT 

GAACATTGTCACAGATTCTCAGTATGCCCTCGGCATCATAGGGCCCGGACCAGGGCATTCCAA 

TTGGCGCGCCATGGCGTCTGACTITAATCrACCTCCTGGGCCAGGCCCTGG 

CTATGTGGACGGCGCTGCAAACAGGGAGACTAAGGGACCCGGACCCGGCGGCGCTGTAGTCA 

TTCAGGACAACTCAGACATCAAGGTGGTTCCCGGTCCAGGCCCCGGGTTCAGAAAGTATACCG 

CCTTCACTATTCCGTCCATCAACAATGAGTGA 
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HIV-1043 PADRE 

MEKVYLAWWAHKGIGGGPGPGQKQITKIQ 
KJLRQRKIDRLIDGPGPGQHLLQLTVWGIKQLQGPGP 

VHQAISPRTLNGPGPGIKQFIhMWQEVGKAMYGPGPGWAGIKQEFGIPYNPQGPGPGKTAVQMA 

VHHNFKRGPGPGSPAIFQSSMTKILEPGPGPGEVMVTO 

GPGPGAETFYVDGAA>fl^TKGPGPGGA\^QDN^^ 

KFVAAWTLKAAA 

ATGGAGAAGGTGTACCTGGCCTGGGTTCCAGCCCACAAAGGCATCGGGGGAGGGCCCGGACC 

TGGGCAGAAACAGATCACCAAGATCCAGAACTTCCGGGTATACTACCGGGGACCTGGTCCAG 

GTTGGGAGTTTGTGAACACACCACCCTTAGTAAAGCTCTGGTACCAGGGCCCCGGTCCCGGAT 

ACCGTAAAATCCTGAGGCAAAGAAAGATAGATCGCCTCATTGATGGCCCGGGCCCAGGCCAG 

CACCTTCTGCAGCTTACAGTGTGGGGAATTAAACAGCTGCAGGGGCCGGGCCCCGGGGGGGA 

AATTTATAAAAGGTGGATCATTCTGGGTCTGAACAAGATCGTCCGCATGTATGGCCCTGGACC 

CGGACAGGGGCAGATGGTCCACCAAGCAATCAGCCCTCGAACCTTGAATGGACCGGGCCCAG 

GAATCAAGCAATTCATTAACATGTGGCAAGAAGTTGGTAAGGCTATGTACGGTCCCGGCCCTG 

GATGGGCAGGGATAAAACAGGAGTTTGGAATCCCTTACAATCCCCAGGGTCCTGGGCCAGGT 

AAAACGGCAGTGCAGATGGCCGTGTTCATTCATAATTTTAAGCGGGGCCCTGGACCT 

CCAGCTATATTTCAAAGTTCGATGACCAAAATCTTGGAGCCCGGCCCAGGGCCGGGCGAAGT 

GAACATTGTCACAGATTCTCAGTATGCCCTCGGCATCATAGGGCCCGGACCAGGGCATTCCAA 

TTGGCGCGCCATGGCGTCTGACTTTAATCTACCTCCTGGGCCAGGCCCTGGCGCGGAAAC^ 

CTATGTGGACGGCGCTGCAAACAGGGAGACTAAGGGACCCGGACCCGGCGGCGCTGTAGTCA 

TTCAGGACAACTCAGACATCAAGGTGGTTCCCGGTCCAGGCCCCGGGTTCAGAAAGTATACCG 

CCTTCACTATTCCGTCCATCAAC^ 

GGACCCTGAAGGCTGCCGCTTGA 

HIV 75mer 

EKVTLAWWAHKGIGGPGPGQGQMVHQMSPRTL^ 
FTEPSINNE 

GAGAAGGTGTACCTGGCCTGGGTGCCTGCCCACAAGGGAATCGGAGGACCTGGCCCTGGACA 
GGGACAGATGGTGCACCAGGCCATCAGCCCTAGGACCCTGAACGGACCTGGACCTGGAAGCC 
CTGCCATCTTCCAGAGCAGCATGACCAAGATCCTGGAGCCCGGACCTGGACCTGGATTCAGGA 
AGTACACCGCCTTCACCATCCCCAGCATCAACAACGAGTGA 
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PfHTL 

MQVQIQSIJ?LLLLWWGSRGRHNWVNH^ 
KYKLATSVLAGLLGPGPGQTNFKSLLRNL^ 

mkavcvegpgpgmnyygkqew 

klngpgpgllifhingkiiknsegpgpgagllgnvstvllggvgpgpgkykiaggiagglallgpgp 

GMRKLAILSVSSFLFV 



ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 

GGAAGGCACAACTGGGTGAATCATGCTGTGCCCCTGGCTATGAAGCTGATCGGCCCTGGACC 

AGGGAAATGCAACCTCTACGCAGACAGCGCCTGGGAGAACGTCAAGAATGGCCCCGGACCTG 

GGAAATCCAAGTATAAGCTCGCTACCTCTGTGCTGGCAGGCCTGCTCGGACCAGGCCCCGGAC 

AGACAAATTTCAAAAGCCTGCTCAGAAACCTGGGAGTGTCCGAGGGGCCTGGCCCAGGATCT 

AGCGTCirTAATGTGGTCAACTCCTCTATTGGGCTCATCATGGGACCCGGACCTGGGGTGAAA 

AATGTCATTGGCCCATTCATGAAGGCCGTGTGTGTCGAAGGACCCGGGCCTGGCATGAACTAC 

TATGGAAAGCAAGAAAATTGGTACAGCCTGAAGAAAGGCCCTGGGCCAGGCGGACTGGCTTA 

CAAGTITGTGGTCCCAGGGGCAGCCACTCCCTATGGGCCTGGGCCAGGCCCCGATTCCATCCA 

GGACTCTCTCAAAGAGAGCCGGAAACTGAACGGACCC^ 

CAATGGCAAAATTATCAAGAACAGCGAGGGACCTGGGCCAGGCGCCGGACTGCTGGGGAACG 
TGTCCACCGTCCTGCTCGGCGGAGTGGGGCCCGGCCCTGGGAAGTACAAGATCGCTGGAGGG 
ATCGCAGGCGGACTGGCCCTCCTGGGCCCAGGACCAGGGATGCGCAAACTGGCTAT^ 
GTCTCCAGCTTTCTGTTTGTGTGA 
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Protein 



Sequence 



Restriction 



HIV gag 386 
HIV gag 271 
HIVpol774 
HIVpol 448 
HT/poll63 
HIVpol 498 
HIVpol879 
HIVpol 132 
HIVpol772 . 
HIVpol 183 
HIV env 134 
HIV env 651 
HIV env 163 
KVnef221 
HIV vpr 59 
HIV vpr 62 
HIVpol929 
HIVpol722 
HIVpol971 
HIVpoI347 
HIVpol98 
HIV env 61 
HIV env 47 
HIVneflOO 
KVvif7 
HIV gag 162 
HIV gag 545 
HIV gag 237 
HIVpoll86 
HIVpol893 
HIV env 259 
HIV env 250 
HIVnef94 
HIV rev 75 
HIVpol 684 
HIV gag 317 
HIVpol 368 
HIVpol 295 
HIVpol 533 
HIVpol 244 
HIVpol 530 
HIVpol 597 
HIV env 681 
HTV env 671 



VLAEAMSQV 

MTNNPPIPV 

MASDFNLPPV 

KLVGKLNWA 

LVGPTPVNI 

ILKEPVHGV 

KAACWWAGI 

KMIGGIGGFI 

RAMASDFNL 

TLNFPISPI 

KLTPLCVTL 

LLQLTVWGI 

SLLNATDIAV 

LTFGWCFKL 

AURILQQL 

RILQQLLFI 

QMAVHHNFK 

KVYLAWVPAHK 

KIQNFRVYYR 

ACFQSSMTK 

VTIKIGGQLK 

TTLFCASDAK 

VTVYYGVPVWK 

QVPLKPMTYK 

VMWWQVDR 

QMVHQAISPR 

YPLASLRSLF 

HPVHAGPIA 

FPISPIETV 

IPYNPQSQGW 

IPIHYCAPA 

CPKVSFEPI 

EPVRPQVPL 

VPLQLPPL 

EVMVTDSQY 

FRDYVDRFY 

VIYQYMDDLY 

VTVLDVGDAY 

IYQEPFKNL 

PYNTPVFAI 

TYQIYQEPF 

YWQATWIPEW 

IWGCSGKLI 

RYLKDQQLL 



HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A2 

HLA-A3 

HLA-A3 

HLA-A3 

HLA-A3 

HLA-A3 

HLA-A3 

HLA-A3 

HLA-A3 

HLA-A3 

HLA-A3 

HLA-B7 

HLA-B7 

HLA-B7 

HLA-B7 

HLA-B7 

HLA-B7 

HLA-B7 

HLA-B7 

HLA-A1 

HLA-A1 

HLA-A1 

HLA-A1 

HLA-A24 

HLA-A24 

HLA-A24 

HLA-A24 

HLA-A24 

HLA-A24 
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Protein 


Sequence 


Restriction 


HIV env 55 


VWKEATTTLF 


HLA-A24 


HIV vpr 46 


IYETYGDTW 


HLA-A24 


HIVvpr 14 


PYNEWTLEL 


HLA-A24 


HIV gag 298 


KRWuTGLNKTVRMY 


HLA-DR 


HIVpol 596 


WEFV3SJTPPLVKLWYQ 


HLA-DR 


HIVpol 956 


QKQITKIQNFRVYYR 


HLA-DR 


HIVpol 712 


KVYLAWVPAHKGIGG 


HLA-DR 


HTV gag 294 


GEIYKRWnLGLNKI 


HLA-DR 


HTV pol 711 


EKVYLAWVPAHKGIG 


HLA-DR 


HTV env 729 


QHLLQLTVWGIKQLQ 


HLA-DR . 


HTV gag 171 


QGQMVHQAI5PRTLN 


HLA-DR 


HIVpol 335 


SPAIFQSSMTKILEP 


HLA-DR 


HTV env 566 


IKQFINMWQEVGKAMY 


HLA-DR 


HIVpol 303 


FRKYTAF1TPSINNE 


HLA-DR 


HIVpol 758 


HSNWRAMASDFNLPP 


HLA-DR 


HIV pol 915 


KTAVQMAVFIHNFKR 


HLA-DR 


HIVvpu31 


YRKILRQRKIDRLID 


HLA-DR3 


HIV pol 874 


WAGIKQEFGIPYNPQ 


HLA-DR3 


HIV pol 674 


EVNIVTDSQYALGn 


HLA-DR3 


HTV pol 619 


AETFYVDGAANRETK 


HLA-DR3 


HIV pol 989 


GAWIQDNSDIKWP 


HLA-DR3 


HCVNS41812 


LLFNILGGWV 


HLA-A2 


HCVNS1/E2728 


FLLLADARV 


HLA-A2 


HCVNS4 1590 


YLVAYQATV 


HLA-A2 


HCVNS5 2611 


RIJVFPDLGV 


HLA-A2 


HCV CORE 132 


DLMGYTPLV 


HLA-A2 


HCVNS41920 


WMNRLIAFA 


HLA-A2 


HCV NS4 1666 


VLVGGVLAA 


HLA-A2 


HCV NS4 1769 


HMWNFISGI 


HLA-A2 


HCV NS4 1851 


ILAGYGAGV 


HLA-A2 


HCV CORE 35 


YLLPRRGPRL 


HLA-A2 


HCVNS1/E2726 


LLFLLLADA 


HLA-A2 


HCV LORF 1131 


YLVTRHADV 


HLA-A2 


HCV CORE 51 


KTSERSQPR 


HLA-A3 


HCV CORE 43 


RLGVRATRK 


HLA-A3 


HCVENV1290 


QLFTFSPRR 


HLA-A3 


HCVNS1/E2 632 


RMYVGGVEHR 


HLA-A3 


HCV NS3 1396 


LIFCH5KKK 


HLA-A3 


HCVNS4 1863 


GVAGALVAFK 


HLA-A3 


HCV NS4 1864 


VAGALVAFK 


HLA-A3 


HCV NS3 1262 


LGFGAYMSK 


HLA-A3 


HCV Core 169 


LPGCSFSEF 


HLA-B7 


HCV NS5 2922 


LSAFSLHSY 


HLA-A1 


HCV NS3 1128 


CTCGSSDLY 


HLA-A1 


HCV NS5 2180 


LTDPSHTTA 


HLA-A1 
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Protein . 

HCV Core 126 
HCVNS31305 
HCV NS4 1765 
HCV NS5 2875 
HCV NS5 2639 
HCV NS4 1765 
P. falciparum SSP2-230 
P. falciparum EXP1-83 
P. falciparum CSP-7 
P. falciparum LSA1-94 
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P. falciparum SSP2-539 
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P. falciparum SSP2-14 
P. falciparum EXP1-80 
P. falciparum EXP1-91 
P. falciparum SSP2-523 
P. falciparum EXP1-10 
P. falciparum LSA1-11 
P. iFalciparum SSP2-126 
P. falciparum CSP-15 
P. falciparum LSA1-1794 
P. falciparum LSA1-9 
P. falciparum SSP2-8 
P. falciparum CSP-394 
P. falciparum EXP1-2 
P. falciparum CSP-344 
P. falciparum LSA1-59 
P. falciparum SSP2-.207 
P. falciparum LSA1-1671 
P. falciparum LSA1-1876 
P. falciparum SSP2-13 
P. falciparum LSA1-1881 
P. falciparum CSP-55 
P. falciparum LSA1-5 
P. falciparum CSP-2 
P. falciparum CSP-53 
P. falciparum CSP-375 
P. falciparum SSP2-61 
P. falciparum SSP2-165 
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Sequence Restriction 

LTCGFADLMGY HLA-A1 

LADGGCSGGAY HLA-A1 

FWAKHMWNP HLA-A24 

RMILMTHFF HLA-A24 

VMGSSYGF HLA-A24 

FWAKHMWNFI HLA-A24 

FMKAVCVEV HLA-A2 

GLLGWSTV HLA-A2 

ILSVSSFLFV HLA-A2 
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GVSENEFLK HLA-A3 
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MRKLAILSVSSFLFV HLA-DR 
MNYYGKQENWYSLKK HLA-DR 

SSVFNWNSSIGLIM HLA-DR 
RHNWVNfiAVPLAMKLI HLA-DR 

PDSIQDSLKESRKLN HLA-DR3 
KCNLYADSAWENVKN HLA-DR3 
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Figure 20A 
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*XRN ~ Cross binding, number of HLA types in the supertype panel of 5 for which significant binding as detected 
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Figure 20D 

HBV-2 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFV^ 

GVGLSRYVARLFLLTRILTISTLPETTVVRRQAFTFSPTYKGi^ 

VFKVGNFTGLYIOjPSDFFPSVKTLWKAGILYKOTSIPOT^ 



ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCC 
TGTGGAAGGCCGGAATCCTGTATAAGGCC^GTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTTCCTGCC 
TAGCGATTTCTTTCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTG 
GGAGTGGGACTGTCCAGGTACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAG 
AGACCACCGTGGTGAGGAGGCAGGCCTTCACCTTTAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCT 
GCTGGTGCCCTTTGTGAATATCCCTATCCCTAGCTCCTGGGCTTTCAAGACCCCAGCCAGGGTGACCGGAGGA 
GTGTTTAAGGTGGGAAACTTCACCGGCCTGTATAACCTGCCCAGCGATTTCTTTCCTAGCGTGAAGACCCTGT 
GGAAGGCCGGAATCCTGTACAAGAATGTGTCCATCCCTTGGACCCACAAGGGAGCCGCTCTGGTGGTGGACTT 
TTCCCAGTTCAGCAGAAATTCCGCTATCTGCTCCGTGGTGAGGAGAGCTCTGATGCCACTGTATGCCTGTATC 
TGA 

Figure 20E 

HBV-2A 

MGMQVQIQSLFLLLLWPGSRGHTLWKAGIIiYKAKFV^ 

GVGLSRYVARLFLLTRILTISTLPETTVVRRQAFTFSPTYKGAAAWLSLLVPFVNIPIPSSWAFKTPARVTGG 
VFKVGNFTGLYNLPSDFFPSVICrLWKAGILYKW^ 

PSLYKKYTSFPWLLNAHPAAMPHLLKAAADLLDTASALYNAAARFSWLSLLVPFNAASWPKFAVPN^ 
ETVTjEYKALSLDVSAAFYGAAEYLVSFGVWGAALMPLYACI 



ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCC 
TGTGGAAGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTTCCTGCC 
TAGCGATTTCTTTCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTG 
GGAGTGGGACTGTCCAGGTACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAG 
AGACCACCGTGGTGAGGAGGCAGGCCTTCACCTTTAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCT 
GCTGGTGCCCTTTGTGAATATCCCTATCCCTAGCTCCTGGGCTTTCAAGACCCCAGCCAGGGTGACCGGAGGA 
GTGTTTAAGGTGGGAAACTTCACCGGCCTGTATAAC^ 

GGAAGGCCGGAATCCTGTACAAGAATGTGTCCATCCCTTGGACCCAGAAGGGAGCCGCTCTGGTGGTGGACTT 
TTCCCAGTTCAGCAGAAATAGCGCCATCTGTTCGGTCGTGAGAAGGAAAGCCTGGATGATGTGGTACTGGGGT 
CCTAGTCTGTATAAGAAGTACACCTCATTCCCATGGCTCTTGAATGCCCATCCCGCTGCAATGCCACACCTGC 
TTAAAGCTGCGGCGGATCTGCTGGACACAGCCTCAGCTTTATATAATGCTGCAGCAAGATTCTCCTGGTTGTC 
TCTCTTAGTGCCCTTCAACGCAGCTTCCTGGCCAAAATTTGCCGTTCCGAACCTGAAGCTCACTTTTGGAAGA 
GAGAC^GTACTTGAATACAAAGCACTAAGCCTTGACGTGTCAGCAGCCTTCTACGGAGCAGCAGAATATCTAG 
TATCTTTTGGGGTCTGGGGCGCAGCCCTGATGCCTCTATACGCCTGCATTTGA 
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Figure 20F 

HBV-2B 

MGMQVQIQSLFLLLLWPGSRGHTLWKAGILYKAKFVAAWT^ 
GVGLSRWARkFLLTRILTISTLPETTVVRRQAFTFSPTYK^^ 
5 WCTGNFTGLYNLPSDFFPSVKTLWKAGILYKNVSIPW 

WGLSLDVSAAFYNAAAKYTSFPWLLNAHPAAMPHLLKAA^ 
KAAWMMWYWGPSLYKAAARFSWLSLLVPFGAAALMPLYACI 



ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCC 
TGTGGAAGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTTCCTGCC 

10 TAGCGATTTCTTTCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTG 
GGAGTGGGACTGTCCAGGTACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAG 
AGACCACCGTGGTGAGGAGGCAGGCCTTCACCTTTAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCT 
GCTGGTGCCCTTTGTGAATATCCCTATCCCTAGCTCCTGGGCTTTCAAGACCCCAGCCAGGGTGACCGGAGGA 
GTGTTTAAGGTGGGAAACTTCACCGGCCTGTATAACCTGCCCAGCGATTTCTTTCCTAGCGTGAAGACCCTGT 

15 GGAAGGCCGGAATCCTGTACAAGAATGTGTCCATCCCTTGGACCCACAAGGGAGCCGCTCTGGTGGTGGACTT 
TTCCCAGTTCAGCAGAAATTCAGCAATTTGTTCGGTGGTGAGAAGAAAGGAATATCTTGTTTCATTTGGCGTC 
TGGGGGCTGTGACTGGATGTAAGTGCGGCATTTTAG^ 

TAAACGCACACCCAGCTGCAATGCCGCATCTACTGAAAGCAGCCGCTGACCTCTTAGACACTGCCTCCGCTCT 
GTACAACTCTTGGCCCAAGTTTGCCGTGCCTAATCTCAAGTTGACCTTCGGTAGAGAGACAGTCTTAGAATAC 
20 AAAGCGGCCTGGATGATGTGGTACTGGGGACCCTCTCTGTATAAAGCCGCTGCAAGGTTCTCCTGGCTTAGCC 
TTCTCGTACCATTCGGAGCAGCTGCCCTAATGCCTTTGTACGCATGCATCTGA 
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Figure 21 A 
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Figure 21D 

HBV-21A 

MGMQVQIQSLFLLLLWVPGSRGSWPKPAVPNLKAAAAKFVAAWTLKAAAKSTLPETTVVRRKHP 
AAHTLWKAGILYKXAFLLTRILTIGALSLDVSAAFTO^ 
5 GGVFKAAEYLVSFGVWGAAAYMDDVVLGVNDLLDT^ 

AICSVVRRKNFLLSLGIHLNIPIPSSWAFKAAWLSLL^^ 
TYK 



ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGATCTTGGC 
CTAAATTCGCAGTGCCAAACCTTAAAGCCGCGGCTGCTAAGTTCGTAGCTGCCTGGACACTAAAGGCCGCCGC 

10 TAAGAGCACACTGCCAGAGACCACCGTGGTCCGGCGAAAGCATCCAGCCGCAATGCCCCACTTGCTCAAAGCA 
GCCGCCCAC^CTCTTTGGAAGGCTGGGATATTGTACAAGAAAGCCTTCCTTCTGACCAGGATATTAACTATCG 
GAGCTCTGTCACTCGACGTTTCTGCTGCCTTCTACAACGCGGCGGCAAAATACACTAGCTTTCCATGGCTACT 
CAACG(^GCCGCCAGATTTTCTTGGCTATCACTACTGGTGCCATTTAATGCAGCAACACCTGCTAGAGTGACT 
GGCGGCGTCTTTAAAGCAGCCGAGTACTTGGTGAGCTTTGGCGTCTGGGGTGCAGCGGCATATATGGATGATG 

15 TAGTGTTAGGGGTGAACGACCTCCTGGACACAGCCAGTGCGCTGTACAATGCAGCTGCATTCCCGCATTGCCT 
AGCCTTCAGTTATATGAAAGCAGCAGCCTGGATGATGTGGTACTGGGGACCGTCCCTTTATAAAGCAGCTTCA 
GCAATCTGTTCCGTTGTGAGGAGAAAAAACTTTTTACTCTCCCTCGGTATTCACCTGAACATTCCCATCCCTT 
CCTCATGGGCATTCAAAGCCGCTTGGCTGAGTCTACTCGTACCTTTCGTTAATGCATTTCTGCCCAGCGACTT 
TTTCCCCTCGGTAAAACTGACATTCGGACGCGAAACAGTCCTTGAATATAAGCAGGCCTTCACGTTCTCACCA 

20 ACCTATAAATGA 

Figure 21E 

HBV-21B 

MGMQVQIQSLFLLLLWVPGSRGYMDDVVLGWAAAEYLVSFGVWNDLLDTASALYGAAHTL 
PSDFFPSVKAFPHCLAFSYMKAARFSWLSLLVPFNAASWPKFAVPl^K^ 
25 RKAFLLTRILTINIPIPSSWAFKAAWMMWYWGPSLYKAAATPARVTGGVFKAANFL 
EYKHPAAMPHLLKAASTLPETTVVRRKWLSLLVPFW 
PWLL 



ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGATACATGG 
ATGACGTTGTGTTAGGCGTTAATGCAGCCGCAGAATATCTCGTGTCATTCGGCGTCTGGAACGACCTGTTGGA 

30 CACTGCATCTGCTCTGTACGGTGCAGCCCATACCCTGTGGAAGGCCGGAATCCTCTACAAAAAGGCATTCCTA 
CCTAGCGACTTTTTTCCTTGAGTGAAAGCCTTCCCACATTGCCTAGCATTCTCGTATATGAAAGCGGCTAGGT 
TCTCATGGCTTAGTCTTCTAGTACCTTTCAATGCCGCCTCCTGGCCCAAATTCGCCGTACCAAATCTAAAAGC 
GGCCGCGCAGGCCTTTACATTCTCTCCGACTTATAAAAATGCAGCAGCCTCCGCTATTTGTAGCGTCGTGCGC 
CGAAAGGCCTTCCTGCTAACCCGGATTTTGACGATAAACATCCCCATCCCTTCTAGCTGGGCTTTCAAAGCAG 

35 CATGGATGATGTGGTACTGGGGTCCCAGCTTATACAAAGCTGCGGCAACCCCAGCAAGAGTGACAGGGGGCGT 
GTTTAAGGCCGCCAACTTCCTCCTGAGTCTCGGAATACACCTGAACTTAACCTTTGGGAGAGAGACAGTACTG 
GAGTATAAAC^CCCAGCAGCTATGCCGCACOTACTCAAAGCCGCTTCAACACTCCCAGAAACAACTGTAGTGA 
GGAGAAAATGGCTCTCCCTGCTTGTCCCATTTGTCAACGCCGCCGCCGCTAAGTTTGTGGCCGCTTGGACACT 
TAAGGCTGCAGCAAAGTTGTCACTTGATGTTAGTGC^GCGTTOTATAACGCAGCTGCAAAATACACTTCCTTT 

40 CCCTGGCTGCTGTGA 
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Figure 22A 
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Figure 22D 

HBV-30B 

MGMQVQIQSLFLLLLOTPGSRGFLLTRILTINAAASWPKF^ 

QAFTFSPTYKGAAANVSIPWTHKGAAAFLLSLGIHLNIPIPSSWAFKAAALWFHISCLTFKAAAILLLCLIFL 
5 LNAAAYPALMPLYACINAHPAAMPHLLKAAASFCGSPYKAAG 

FPHCIAFSYMKAEYLVSFGVWAALTFGRETVLEYKAAALPSDFFPSV 

ARWMCLRRFIINAARFSWLSLLVPFNAATPARVTGGVFKAAWLSLL^ 

AKVmMWYWGPSLYKAASTLPETTVVRRKLSLDVSAAFY 

ATGGGAATGCAGGTCCAGATACAGAGCTTGTTCCTCCTCCTGCTTTGGGTCCCCGGATCAAGGGGTTTCCTCC 

10 TAACCCGCATCCTGACAATTAACGCCGCAGCCTCCTGGCCAAAATTTGCCGTGCCAAATCTCAAGGCAGCTGC 
ACACACACTATGGAAAGCAGGGATACTGTACAAGAAAGCCGATCTGCTAGACACAGCGTCTGCGTTGTACAAC 
CAGGCTTTTACTTTCTCTCCTACATATAAAGGCGCAGCTGCAAACGTGAGTATCCCTTGGACGCACAAAGGAG 
CCGCTGCCAACTTCTTACTGTCCCTGGGCATCCATCTAAATATCCCTATTCCTTCATCCTGGGCATTTAAAGC 
AGCCGCCTTATGGTTCCACATAAGTTGTCTGACCTTCAAAGCCGCAGCAATCCTGCTCCTTTGCCTCATTTTC 

15 TTACTAAACGCCGCTGCCTATCCAGCTCTTATGCCATTGTACGCATGTATCAACGCCCACCCCGCAGCAATGC 
CCGACCTCCTTAAAGCTGCCGCCAGTTTCTGCGGTTCTCCTTATAAAGCAGCAGGGCTGTCCAGATACGTAGC 
TAGGCTAAACAAGTATACCAGCTTCCCCTGGTTACTTAATTTCCTGCCGTCAGATTTCTTTCCATCAGTTAAG 
GCCTTCCCTCATTGTCTGGCCTTTAGCTACATGAAGGCTGAATATTTGGTATCCTTCGGCGTGTGGAATGCGG 
CACTGACATTTGGAAGGGAGACAGTGCTCGAGTACAAAGCCGCCGCACTACCCTCGGACTTCTTCCCATCGGT 

20 CAAAGCTTA(^TGGACGATGTAGTCCTCGGCGTTAACTTAGTAGTGGACTTTTCTC^TTTTCC^\GAAACGCA 
GCGGCCAGATGGATGTGCCTTCGGCGTTTTATAATAAACGCCGCTCGATTCAGCTGGCTATCACTCCTAGTTC 
GATTTAATGCAGCTACACCCGCACGGGTGAGAGGTGGAGTTTTCAAGGCAGCGTGGCTTTCACT 
ATTTGTGAACTCAGCTATTTGCTCAGTAGTGAGAAGGAAGGCAAAATTCGTCGCTGCCTGGACTCTCAAAGCT 
GCCGCAAAGTGGATGATGTGGTATTGGGGACCGAGCTTGTACAAAGCGGCCTCTACTCTGCCAGAAACTACCG 

25 TAGTGAGAAGAAAACTGAGCCTGGACGTCAGCGCGGCATTCTACTGA 

Figure 22E 

HBV-30C 

MGMQVQIQSLFLLLLWPGSRGFLLSLGIHLNAAAKYTSFPWLLNAAARFSWLSLLVPFNAAFPHCLAFSYM 
AALVVDFSQFSRGAILLLCLIFLI^AAAHTLWKAGIL^ 
30 LVPFWFLLTRILTINIPIPSSWAFKAAAEYLV^ 

PKFAVPNLKAAAS AI CS VVRRKLSLDVS AAF YNAA^^ I PWTHKGAAGLS RYVARLN 

AAASTLPETTVVRRKHPAAMPHLLKAAARWMCLRRFIINASFCGSPYKAAYMDDVVLG 
AATPARVTGGVFKAAALTFGRETVLEYKQAFTFS PTYK 

ATGGGAATGCAGGTGCAAATACAGTCTCTCTTCCTTTTGCTTCTCTGGGTTCCAGGATCACGGGGCTTCT^ 
35 TTAGCTTGGGCATCCACCTAAATGCTGCTGCAAAATACACATCTTTTCCTTGGCTCCTTAATGCCGCCGCTAG 
GTTTTCATGGCTGAGTCTGCTAGTACCTTTCAATGCGGCTTTCCCACATTGCCTAGCTTTTAGCTATATGAAA 
GCTGCTTTAGTCGTGGACTTTTCACAGTTTAGCAGAGGAGCAATCCTGCTGCTATGTCTGATATTCCTTCTAA 
ACGCAG(^GCCCACACACTCTGGAAAGCTGGTATCCTTTACAAGAAAGCCTGGATGATGTGGTATTGGGGACC 
CAGCCTCTACAAAGCATACCCTGCCCTGATGCCACTATACGCATGCATTGGCGCGGCAGCCTGGTTATCCCTT 
40 TTAGTACCGTTTGTC^CTTTCTATTAACCAGAATCCTGACGATTAATATTCCGATCCCAAGTTCCTGGGCAT 
TCAAAGGAGCCGCGGAGTATCTGGTTTCATTTGGC^ 

GTTCCTCCCCTCCGATTTCTTTCCATCGGTGAAAGACCTCCTTGATACCGCGAGCGCTCTGTACAACTCGTGG 
' CCAAAATTCGCAGTTCCAAACCTAAAAGCCGCCGCCAG 
TCGACGTGTCCGCAGCATTTTATAACGCTGCTGCAAAGTTTGTCGCAGCATGGACAT^ 

45 AGCAGCTU^TGTATCAATACCCTGGACCCACAAGGGTGCAGCCGGGCTO 

GCCGCCGCCTCAACACTGCCTGAGACTACTGTCGTGAGACGCAAACACCCTGCCGCAATGCCCCACCTGCTGA 
AAGCAGCCGCACGATGGATGTGCCTCAGAAGATTCATAATAAACGCTTCTTTCTGTGGGTCACCCTACAAAGC 
CGCTTACATGGACGATGTGGTCCTCGGAGTGAATGCCCTCTGGTTCCATATCAGCTGCCTGACATTCAAGGCA 
GCCGCCACCCCCGCTCGTGTGACAGGAGGTGTCTTCAAAGCCGCGGCACTGACTTTCGGTCGGGAAACTGTAT 

50 TGGAATATAAGCAGGCCTTCACATTCTCCCCAACATACAAGTGA 
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Figure 23A 

B7 A2 Al 
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Figure 23C 



HBV-CL 

MQVQIQSLFLLLLWVPGSRGFLiLSLGIHLNAAAKYTSFPWLLNAAARFSWLSLLVPFNAAPPHCLAFSYMKAA 
LWDFSQ F S RGAILLLCL I FLLNAAAHTLWKAG I LYKKAWMMWYWGPSL YKAYPALMPL YAC IGAAAWLSLLV 
5 PFWFLLTRILTINAAAIPIPSSWAFKAAAEYLVSFGVWNLPSDFFPSVKAAAFLPSDFFPSVKAAADLLDTA 
SALYNSWPKFAVPNLKAAASAICSWl^KLSLDVSAAF^ I PWTHKGAAGLS 

RYVARLNAAASTLPETTVVRRKHPAAMPHLLKAAARWMCLRRFIINASFCG 
S CLTFKAAATPARVTGGVFKAAALTFGRETVLEYKQAFTFS PTYK 



ATGGGAATGCAGGTGCAAATACAGTCTCTCTTCCTTTTGCTTCTCTGGGTTCCAGGATCACGGGGCTTCTTGC 

10 TTAGCTTGGGCATCCACCTAAATGCTGCTGCAAAATACACATCTTTTCCTTGGCTCCTTAATGCCGCC 

GTTTTCATGGCTGAGTCTGCTAGTACCTTTCAATGCGGCTTTCCCACATTGCCTAGCTTTTAGCTATATGAAA 
GCTGCTTTAGTCGTGGACTTTTCACAGTTTAGCAGAGGAGCAATCCTGCTGCTATGTCTGATATTCCTTCTAA 
ACGCAGCAGCCCACACACTCTGGAAAGCTGGTATCCTTTACAAGAAAGCCTGGATGATGTGGTATTGGGGACC 
CAGGCTCTACAAAGCATACCCTGCCCTGATGCCACTATACGCATGCATTGGCGCGGCAGCCTGGTTATCCCTT 

15 TTAGTACCGTTTGTCAACTTTCTATTAACCAGAATCCTGACGATTAATGCTGCCGCCATTCCGATCCCAAGTT 
CCTGGGCATTCAAAGCAGCCGCGGAGTATCTGGTTTCATTTGGCGTATGGAACCTGCCAAGCGACTTCTTTCC 
TTCTGTTAAGGCCGCTGCTTTCCTCCCCTCCGATTTCTTTCCATCGGTGAAAGCCGCTGCCGACCTCCTTGAT 
ACCGCGAGCGCTCTGTACAACTCGTGGCCAAAATTCGCAGTTCCAAACCTAAAAGCCGCCGCCAGTGCCATTT 
GTTCCGTGGTAAGGAGAAT^TTATCACTCGACGTGTCCGCAGCATTTTATAACGCTGCTGCAAAGTTTGTCGC 

20 AGCATGGACATTGAAGGCTGCAGCGAAAGCAGCAAATGTATCAATACCCTGGACCCACAAGGGTGCAGCCGGG 
CTGTCTAGGTATGTGGCGAGGCTAAACGCCGCCGCCTCAACACTGCCTGAGACTACTGTCGTGAGACGCAAAC 
ACCCTGCCGCAATGCCCCACCTGCTGAAAGCAGCCGCACGATGGATGTGCCTCAGAAGATTCATAATAAACGC 
TTCTTTCTGTGGGTCACCCTACAAAGCCGCTTACATGGACGATGTGGTCCTCGGAGTGAATGCCCTCTGGTTC 
CATATCAGCTGCCTGACATTCAAGGCAGCCGCCACCCCCGCTCGTGTGACAGGAGGTGTCTTCAAAGCCGCGG 

25 CACTGACTTTCGGTCGGGAAACTGTATTGGAATATAAGCAGGCCTT 
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Figure 24C 
HBV-HTL 

MGTSFVYVPSALNPADGPGPGLCQVFADATPTGWGLGPGPGRHYLHTLWKAGILYKGPGPGPHOT 
WGELMTLAGPGPGESRLWDFSQFSRGNGPGPGPFLLAQFTSAICSWGPGPGLVPFVQWFVGLSPTVGPGPG 
5 LHLYSHPIILGFRKIGPGPGSSNLSWLSLDVSAAFGPGPGLQSLTNLLSSNLSWIiGPGPGAGFFLLTRILTIP 
QSGPGPGVSFGWIRTPPAYRPPNAPIGPGPGVGPLTVNEKRRLKLIGPGPGKQ 
ANWILRGTS FVYVPGPGPGKQAFTFS PTYKAFIiCGPGPGAKFVAAWTLKAAA 



ATGGGAACTTCTTTTGTGTATGTCCCTTCCGCTCTGAACCCAGCAGACGGACCCGGGCCTGGCCTGTGCCAGG 
TCTTCGCCGACGCAACTCCCACAGGGTGGGGGCTGGGGCCAGGACCAGGCAGGCACTACCTGCATACTCTGTG 

• 10' GAAGGCAGGAATCCTCTATA71AGGGCCCGGCCCAGGCCCTCACCACACCGCCCTGAGGCAGGCCATCCTGTGC 
TGGGGGGAGCTCATGACCCTGGCCGGACCTGGACCCGGGGAGAGCAGACTGGTGGTGGATTTCAGCCAATTCA 
GCAGAGGAAACGGACCCGGCCCTGGGCCTTTTCTGCTGGCTCAGTTTACATCTGCTATTTGTTCTGTGGTCGG 
CCCCGGGCCCGGACTCGTGCCTTTCGTGCAGTGGTTCGTGGGACTGTCCCCTACAGTCGGGCCCGGCCCAGGG 
CTGCATCTGTACTCCCACCCAATCATCCTCGGCTTCCGCAAGATTGGACCCGGCCCAGGCTCCAGCAATCTCT 
15 CCTGGCTCTCTCTGGACGTGTCTGCCGCCTTTGGCCCTGGACCAGGCCTGCAAAGCCTGACTAATCTGCTCAG 
CAGCAACCTGTCCTGGCTGGGACCTGGCCCAGGGGCTGGCTTCTTTCTGCTCACCCGGATTCTCACAATTCCC 
CAGTCCGGACCAGGACCAGGAGTCAGTTTCGGGGTGTGGATCAGGACCCCTCCTGCTTATAGACCACCCAATG 
CTCCAATCGGCCCCGGCCCTGGCGTCGGGCCACTGACCGTGAATGAGAAGCGCCGGCTGAAGCTGATCGGCCC 
TGGCCCTGGCAAGCAGTGCTTTCGCAAACTGCCCGTGAACAGACCTATTGATTGGGGCCCCGGCCCTGGAGCA 

20 GCCAACTGGATTCTCAGGGGAACAAGCTTCGTCTACGTGCCCGGGCCCGGACCAGGGAAGCAGGCTTTTACCT 
TCTCTCCCACTTACAAGGCCTTCCTCTGTGGGCCAGGCCCCGGCGCCAAGTTTGTGGCAGCATGGACCCTCAA 
AGCCGCTGCCTGA 



